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A study  of  transcriptional  initiation  of  sorghum  mitochondrial  DNA  was  conducted, 
including  an  examination  of  four  sorghum  mitochondrial  genes  ( atp6-l , atp6-2,  atp9  and 
urf209 ) and  two  open  reading  frames  associated  with  cytoplasmic  male-sterility  in  sorghum 
(orf265  and  orfl07).  Transcriptional  initiation  sites  were  determined  by  primer  extension 
analyses  and  in  vitro  capping  with  guanylyltransferase  in  combination  with  ribonuclease 
protection.  Nine  transcriptional  initiation  regions  were  identified,  including  two  regions 
for  orf265  and  three  regions  for  orfl07.  In  addition  to  a distal  promoter,  orf265  has  a 
second  initiation  region  that  resides  in  a maize-like  sequence,  and  shares  a promoter 
homologous  to  that  of  T-urfl3  in  maize.  The  transcripts  of  atp9  have  two  origins.  The 
gene  is  cotranscribed  with  rmfM,  which  is  323  bp  5'  to  atp9.  Processing  of  the  precursor 
transcript  at  the  3'  terminus  of  rmfM  results  in  5'  processed  transcript  termini  of  atp9. 
Unusually,  transcripts  with  termini  3'  to  frrcfM  are  also  suitable  substrates  for 
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guanylyltransferase,  suggesting  that  a transcriptional  initiation  region  of  atp9  exists  3 to 
rrafM.  This  novel  observation  suggests  that  as  few  as  21  bp  5'  to  the  transcription 
initiation  site  of  atp9  may  be  sufficient  for  recognition  by  RNA  polymerase.  Alternatively, 
part  of  the  tmfM  coding  region  may  be  involved  as  a promoter  of  atp9  and  play  a role  in  the 
initiation  of  transcription. 

Sequences  surrounding  the  nine  transcription  initiation  sites  were  compared.  Five 
of  the  nine  initiation  regions,  including  initiation  regions  of  obligate  genes  atp6-2  and  atp9, 
contain  the  proposed  YRTA  element  found  in  other  plant  mitochondrial  systems.  Repeated 
elements  G(AyT)3-4  or  C(A/T)3-4,  present  just  upstream  of  the  transcription  initiation  sites, 
as  found  in  several  plant  systems,  were  observed  for  some  sorghum  genes  in  this  study. 

The  transcription  initiation  region  of  atp6-2  shares  high  similarity  with  a maize  atp6 
promoter,  and  was  tested  in  a heterologous  in  vitro  transcription  system  with  partially 
purified  maize  mitochondrial  RNA  polymerase.  Transcription  of  atp6-2  initiated  at  a 
predicted  site. 
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LITERATURE  REVIEW 


Introduction 

Mitochondria,  as  a site  of  energy  metabolism,  contribute  uniquely  to  a cell  among 
cellular  organelles.  In  addition  to  the  principal  function  of  oxidative  phosphorylation, 
mitochondria  are  also  involved  in  the  biosynthesis  of  numerous  key  intermediates  in  cellular 
metabolism,  such  as  pyrimidines,  amino  acids,  nucleotides,  phospholipids,  folate 
coenzymes,  urea,  heme,  and  contribute  to  import  and  localize  proteins,  phospholipids  and  a 
few  tRNAs  to  the  proper  positions  (Attardi  and  Schatz  1988). 

Like  chloroplasts,  mitochondria  have  their  own  genomes  and  possess  a separate 
enzymatic  machinery  to  express  their  genetic  information  into  functional  proteins  (Tzagoloff 
and  Myers  1986).  The  mitochondrial  energy-generating  pathway  is  composed  of  five 
enzyme  complexes  (NADH  dehydrogenase,  succinate  dehydrogenase,  cytochrome  c 
reductase,  cytochrome  c oxidase  and  ATP  synthase).  Apart  from  the  conventional 
phosphorylating  pathways,  mitochondria  of  all  plant  species  examined  have  a 
nonphosphorylating  alternative  pathway  (Day  et  al.  1980,  Siedow  and  Berthold  1986).  In 
general,  mitochondrial  genomes  may  contain  genes  coding  for  several  subunits  of  Complex 
I ( nadl , nad2,  nad3,  nad4,  nad4L,  nad5,  nad6,  and  nad7),  one  gene  coding  for  one  subunit 
of  Complex  III  ( cob ),  three  genes  coding  for  the  three  largest  subunits  of  complex  IV 
(coxl,  cox2,  and  cox3),  several  genes  coding  for  several  subunits  of  the  ATP  synthase 
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complex  ( atpl , atp6  and  atp9),  and  some  genes  coding  for  tRNAs  as  well  as  several 
subunits  of  ribosomal  RNAs  (Bonen  and  Brown  1993,  Tzagoloff  and  Myers  1986). 

Mutations  in  the  mitochondrial  genome  can  be  limited  lesions  that  affect  single 
enzymatic  functions  which  result  in  mitochondrial  defects,  or  can  be  lethal  to  a cell  (Attardi 
and  Schatz  1988).  In  human  mitochondria,  mutations  of  mtDNA  resulting  in  either  amino 
acid  substitution,  nucleotide  substitution  in  tRNA  genes;  insertion-deletion  or  copy  number 
changes  of  mitochondrial  DNA  (mtDNA)  have  been  found  to  be  associated  with  some 
common  degenerative  diseases,  including  neurogenic  muscle  weakness,  ocular  myopathy, 
and  maternally  inherited  myopathy  and  cardiomyopathy  (Wallace  1992).  In  yeast, 
mutations  in  mitochondrial  DNA  can  cause  a respiratory  deficiency,  resulting  in  the  inability 
to  utilize  nonfermentable  substrates  (Costanzo  and  Fox  1990).  Plant  mitochondrial  DNA 
mutations  have  been  demonstrated  to  be  associated  with  cytoplasmic  male  sterility  (CMS) 
(Pring  and  Lonsdale  1989,  Braun  et  al.  1992),  abnormal  growth  mutants  (Newton  et  al. 
1990),  the  teosinte  cytoplasmic-associated  miniature  mutation  (Cooper  et  al.  1990)  and  a 
leaf  variegation  phenotype  in  Arabidopsis  (Martinez-Zapater  et  al.  1992).  Consequently, 
proper  function  of  mitochondria,  i.  e.  proper  gene  expression,  is  essential. 

One  of  the  major  aspects  of  study  on  mitochondrial  gene  expression  and  regulation 
is  the  characterization  of  DNA  elements  involved  in  the  transcription  initiation  of  mtDNA, 
which  in  all  cases,  except  one  (Chang  and  Clayton  1986b)  are  located  around  transcription 
initiation  sites  (Biswas  et  al.  1990,  Clayton  1991,  Rapp  and  Stem  1992).  Two  general 
approaches  have  been  widely  used  to  identify  transcript  termini  of  mitochondrial  DNA. 
First,  5'  ends  of  in  vivo  RNAs  can  be  identified  by  either  primer  extension,  SI  nuclease 
mapping,  direct  RNA  sequencing  or  in  vitro  capping  with  guanylyltransferase,  or  a 
combination  of  these  techniques.  Guanylyltransferase  catalyzes  the  incorporation  of  GMP 
from  [cc-32P]GTP  into  5'  termini  which  carry  di-  or  tri-phosphates  and  results  in  a cap 
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structure,  whereas  processed  termini,  which  contain  a 5'  monophosphate,  will  not  be  a 
suitable  template  for  the  enzyme  (Christianson  et  al  1983).  Since  mitochondrial  RNAs  are 
not  capped  in  vivo,  they  can  be  capped  in  vitro  by  this  enzyme. 

Second,  the  putative  functional  promoter  sequences  can  be  defined  by  testing  the 
cloned  mtDNA  as  templates  in  an  in  vitro  transcription  system  with  purified  homologous 
RNA  polymerase.  These  two  approaches,  especially  in  vitro  transcription  analysis,  have 
been  exploited  to  great  detail  in  studying  the  mechanisms  of  transcription  initiation  of  animal 
and  yeast  mtDNAs. 

Compared  with  their  counterparts  in  animal  and  yeast,  plant  mitochondrial  genomes 
are  much  larger  and  more  complex  (Gray  1989,  Lonsdale  and  Grienenberger  1992). 

Frequent  recombination  and  rearrangement  events  could  result  in  duplications  of  part  or 
entire  regions  of  protein-coding  genes  or  generations  of  chimeric  configurations,  including 
transcriptional  regulatory  elements.  Therefore,  it  is  expected  that  multiple  transcription 
initiation  sites  would  exist  in  plant  mtDNA  (Gray  et  al.  1992). 

Transcript-mapping  studies  have  indicated  that  there  can  be  multiple  sites  of 
transcription  initiation  in  plant  mtDNA  (Binder  and  Brennicke  1993b,  Brown  et  al.  1991, 
Covello  et  al.  1991,  Hanic-Joyce  and  Gray  1991,  Mulligan  et  al.  1988a,  1988b,  Rapp  and 
Stem  1992).  Utilizing  in  vitro  transcription  systems,  wheat  mitochondrial  cox2  (Hanic- 
Joyce  and  Gray  1991)  and  maize  atpl,  atp6  and  cox3  promoters  were  defined  (Rapp  and 
Stem  1992).  The  functional  analysis  of  the  maize  atpl  promoter  demonstrated  that  a 
conserved  1 1 nucleotide  motif  of  (A/T)CRTA(G/T)A(A/T)AAA  (R;  purine)  is  an  essential 
element  for  the  atpl  promoter  function  (Rapp  and  Stem  1992,  Rapp  et  al.  1993). 
Nevertheless,  despite  the  general  good  agreement  of  many  mapped  5’  transcript  termini  with 
the  proposed  YRTA  (corresponding  to  the  initially  proposed  CRT  A,  Y;  pyrimidine)  motif, 
the  initiation  regions  for  some  transcripts  show  little  similarity  to  the  conserved  YRTA 
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element  (Binder  et  al.  1994,  Lizama  et  al.  1994,  Mulligan  et  al.  1991).  The  uncertainty 
regarding  the  nature  of  plant  mitochondrial  promoter  emphasizes  the  need  for 
characterization  of  more  promoters  of  plant  mtDNAs. 

Mitochondrial  Genomes 

It  is  widely  accepted  that  the  mitochondrion  originated  from  alpha-purple  bacterium 
as  an  endosymbiont  (Attardi  and  Schatz  1988).  As  far  as  the  structure,  gene  organization 
and  the  mode  of  expression  are  concerned,  mitochondrial  DNA  exhibits  an  extraordinary 
diversity  in  different  organisms  of  the  eukaryotic  kingdoms  of  animals,  plants  and  fungi 
(Gray  1989).  This  diversity  probably  reflects  a polyphyletic  origin  of  past  endocytotic 
events  or  a monophyletic  origin  that  has  led  to  diversity  of  present-day  mitochondria 
through  different  evolutionary  pathways  in  different  eukaryotic  cells  (Attardi  and  Schatz 

1988). 

Animal  Mitochondrial  DNA 

The  most  intriguing  feature  of  the  vertebrate  mitochondrial  genome  is  its  extremely 
compact  organization.  With  few  exceptions,  vertebrate  mtDNAs  are  very  similar  in  size 
(16.5-17.6  kb),  and  present  as  circular  molecules  at  high  copy  number  at  a level  of  1000- 
10000  mtDNA  molecules  per  somatic  cell  (Clayton  1991).  The  genomes  have  an  invariant 
gene  order,  and  are  organized  and  expressed  in  an  essentially  identical  manner  (Anderson  et 
al.  1981).  Except  for  a short  DNA  segment  surrounding  the  origin  of  replication, 
vertebrate  mtDNA  is  saturated  by  structural  genes,  which  are  adjacent  one  to  another 
(Clayton  1984,  1991).  In  most  cases,  there  is  a tRNA  gene  between  adjacent  protein 
coding  genes,  and  all  of  the  genes  lack  introns  (Clayton  1984,  1991).  Most  of  the  genes 
are  encoded  by  the  heavy  (H)-strand,  including  14  tRNA  genes,  2 rRNA  genes  and  13 
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protein  coding  genes  that  encode  seven  subunits  of  NADH  dehydrogenase,  apocytochrome 
b of  ubiquinol  cytochrome  c reductase,  three  subunits  of  cytochrome  c oxidase  and  two 
subunits  of  ATP  synthase  ( atp6  and  atp8)  (Gray,  1989). 

While  the  little  size  variation  among  vertebrate  mtDNAs  is  mainly  confined  to  the 
noncoding  D-loop  region,  which  contains  the  origin  of  DNA  replication  as  well  as 
promoters  for  two-strand  transcription  (Clayton  1991),  the  size  variation  over  threefold 
among  invertebrate  resulted  from  absence  or  duplication  of  the  coding  and  noncoding 
regions  (Gray  1989).  In  the  nematode  worm  Ascaris  suum,  the  mitochondrial  genome 
lacks  the  atp8  gene,  and  the  genome  size  is  as  small  as  14.3  kb  (Wolstenholme  et  al. 

1987).  The  sea  scallop  Plactopecten  magellanicus  mtDNA  has  large  direct  tandem 
duplications  of  the  genome,  and  the  genome  size  extends  to  32.9  kb  (Zouros  1992). 
Mitochondrial  gene  order  varies  among  vertebrates,  insects,  echinoderms,  nematode  and 
platyhelminths  (Gray  1989).  Nevertheless,  animal  mitochondrial  genomes,  both  vertebrate 
and  invertebrate  mtDNAs,  maintain  an  extremely  compact  arrangement. 

Fungal  Mitochondrial  DNA 

The  gene  structure  and  organization  of  fungal  mtDNA,  which  varies  from  17.6  kb  to 
1 15  kb  among  ascomycetes  (Gray  1986),  contrast  remarkably  with  vertebrate  mtDNAs. 
From  observations  with  electron  microscopy,  pulsed  field  gel  electrophoresis  and 
fluorescent  microscopic  technique,  linear  DNA  molecules  were  proposed  as  the  major  form 
of  mitochondrial  genomes  in  fungi  (Bendich  1993).  The  large  size  variation  of  fungal 
mtDNAs  is  due  not  only  to  the  intergenic  spacer  length  but  also  to  the  presence  or  absence 
of  optional  introns. 

In  contrast  to  animal  mtDNAs  in  which  no  introns  have  been  found,  several  fungal 
mitochondrial  genes,  such  as  cob,  coxl  and  rm21  in  Saccharomyces  cerevisiae,  are 
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themselves  discontinuous.  Some  of  the  introns  contain  open  reading  frames  coding  for 
protein  (maturase)  involved  in  the  intron  splicing  (Costanzo  and  Fox  1990).  Other  features, 
including  duplication  of  portions  of  the  genome,  presence  of  short  repetitive  sequences  and 
the  number  of  short  repetitive  sequences,  and  presence  of  novel  open  reading  frames,  also 
contribute  to  the  size  variations  of  fungal  mitochondrial  genomes  (Gray  1989,  Tzagoloff  and 
Myers  1986). 

Generally,  fungal  mtDNA  codes  for  several  major  protein  components  of  respiratory 
chain  enzymes,  including  subunit  I,  II  and  HI  of  cytochrome  c oxidase,  apocytochrome  b, 
subunit  6,  8 and  9 (which  is  nuclear-encoded  in  animal  mtDNAs,  Tzagoloff  and  Myers 
1986)  of  ATP  synthase,  and  a full  set  of  tRNA  as  well  as  the  two  rRNAs  (Tzagoloff  and 
Myers  1986).  In  addition,  fungal  mtDNAs  encode  several  low  abundant  proteins  which 
may  play  an  important  role  in  the  transposition  of  the  introns,  such  as  reverse  transcriptase 
in  yeast  (Kennell  et  al.  1993). 

Plant  Mitochondrial  DNA 

Organization  and  genes 

Higher  plant  mitochondrial  genomes,  which  vary  in  size  from  200  kb  to 
approximately  2500  kb,  are  much  larger  and  more  complex  than  their  animal  or  fungal 
counterparts  (Lonsdale  1989,  Lonsdale  and  Grienenberger  1992).  Even  the  smallest  plant 
mitochondrial  genome,  which  is  208  kb  in  Brassica  hirta,  is  larger  than  the  largest  fungal 
mitochondrial  genome,  which  is  176  kb  in  Agaricus  bitorquis,  and  larger  than  most 
chloroplast  genomes  (Gray  1989).  While  several  unique  features  contribute  to  the  genome 
expansion,  the  most  interesting  character  is  the  relative  high  frequency  of  rearrangements 
that  involve  both  intra-  and  intermolecular  recombinations  (Lonsdale  and  Grienenberger 


7 


1992,  Pring  and  Lonsdale  1985).  Other  features  include  presence  of  repeated  sequences 
(coding  or  noncoding)  ranging  from  a few  up  to  several  hundred  base  pairs,  and  possession 
of  integrated  chloroplast  sequences  (Lonsdale  and  Grienenberger  1992,  Pring  and  Lonsdale 
1985). 

The  process  of  recombination  between  repeated  DNA  elements,  large  repeated 
sequences  as  well  as  small  nucleotide  repeats  (Andre  et  al.  1992),  results  in  either  sequence 
inversions  or  the  resolution  of  circular  molecules  into  smaller  subgenomic  species  (Lonsdale 
and  Grienenberger  1992).  This  process  could  occur  within  molecules,  as  well  as  between 
different  circular  DNA  molecules,  which  share  sequence  homology,  and  thus  give  rise 
multimeric  forms  of  individual  circle  and  more  complex  patterns  of  the  genome  (Lonsdale 
1989,  Lonsdale  and  Grienenberger  1992).  A "master  circle"  model  was  proposed  for  plant 
mitochondrial  genome,  in  which  the  entire  genetic  complexity  is  contained  in  a circular 
chromosome  that  may  be  resolved  into  subgenomic  molecules  by  recombination  through 
directly  repeated  sequences  (Lonsdale  1989,  Lonsdale  and  Grienenberger  1992,  Schuster 
and  Brennicke  1994).  Based  on  observations  of  pulsed  field  gel  electrophoresis  and 
electrophoresis  of  moving  pictures  of  DNA  released  by  lysis  of  organelles,  linear  DNA 
molecules  were  proposed  as  the  major  in  vivo  form  of  mitochondrial  genomes  in  higher 
plants  (Bendich  1993). 

Apart  from  a high-molecular- weight  DNA  component  that  represents  the  main 
genomic  DNA,  in  many  plant  species,  mitochondria  also  contain  some  smaller  circular  and 
linear  DNAs  (Lonsdale  and  Grienenberger  1992,  Pring  and  Lonsdale  1985).  While  most 
of  these  plasmid  sequences  contain  no  sequences  required  for  mitochondrial  functions,  the 
2.3  kb  linear  plasmid  of  maize  contains  the  plasmid  genes  for  trnP  and  trnW  (Leon  et  al. 
1989,  Marechal  et  al.  1987).  These  plasmid  DNAs  are  sometimes  integrated  into  the 
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genomic  mtDNAs,  and  contribute  to  the  complexity  and  large  size  of  plant  mitochondrial 

genomes  (Lonsdale  and  Grienenberger  1992). 

Promiscuous  chloroplast  DNA  sequences  are  found  in  mitochondrial  DNA  (Stem 
and  Lonsdale  1982),  contributing  to  the  large  size  of  the  genome.  While  most  of  the  plastid 
sequences  appear  to  be  nonfunctional  pieces  of  genes,  several  tRNAs  seem  to  be  transcribed 
from  sequences  of  plastid  origin  in  the  mitochondrial  compartment  (Lonsdale  1989). 

Most  of  the  standard  set  of  mitochondria  genes  found  in  mammalian  and  fungal 
mtDNAs  have  also  been  localized  in  many  plant  mitochondrial  genomes.  In  addition,  plant 
mitochondrial  genomes  have  several  genes  that  are  not  detected  in  fungal  or  animal 
mtDNAs.  These  genes  include  rm5,  atpl,  nad7  (Bonen  et  al.  1994),  nad9  (Lamattina  et  al. 
1993),  an  ABC-type  (ATP-binding  cassette)  heme  transporter  gene  (Bonnard  and 
Grienerbenger  1995,  Jekabsons  and  Schuster  1995),  rpsl2  (Gualberto  et  al.  1988)  and 
rpsl4  in  some  plant  mtDNAs  (Newton  1988). 

Unlike  mammals  (Anderson  et  al.  1981)  and  fungi  (Heckman  and  RajBhandary 
1979,  Martin  et  al.  1977),  plant  mitochondrial  genomes  do  not  code  for  sufficient  tRNAs  to 
support  protein  synthesis  (Dietrich  et  al.  1992).  Analysis  of  tRNAs  and  the  tRNA  genes 
revealed  the  tRNAs  whose  genes  are  missing  in  plant  mitochondrial  genomes  are  actually 
imported  from  the  cytosol  (Marechal-Drouard  et  al.  1988). 

A number  of  plant  mitochondrial  native  tRNA  genes  have  been  sequenced,  and  it 
was  found  that  the  plant  mitochondrial  tRNAs  resemble  eubacterial  and  chloroplast  tRNAs 
(65-80%  sequence  similarity)  rather  than  mitochondrial  tRNAs  from  fungi  or  animals  at  the 
level  of  both  primary  sequence  and  potential  secondary  structure  (Dietrich  et  al.  1992).  The 
maturation  of  the  plant  mitochondrial  tRNA  (Marchfelder  and  Brennicke  1994)  is  similar  to 
that  of  yeast  (Costanzo  and  Fox  1990),  where  5'-leader  and  3 -trailer  sequences  are 
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removed  by  the  endonuclease  RNAse  P (Altman  1981)  and  a 3'-endonuclease  respectively, 
followed  by  addition  of  CCA  at  the  3'  end  . 

In  several  plant  mitochondrial  genomes,  rmfM  or  pseudo  rmfM  is  found  5'  close  to 
protein  coding  genes  (Krishnasamy  et  al.  1993,  Sangare  et  al.  1989,  Singh  and  Brown 
1991),  such  as  atp6  (Singh  and  Brown  1991).  The  gene  coding  for  subunit  6 of  ATP 
synthase  is  located  3'  to  fmfM  in  normal  radish  and  fertile  Brassica  napus,  and  3'  to  a 
truncated  pseudo  tmfM  in  Polima  cytoplasm  of  B.  napus.  In  Ogura  cytoplasm  of  radish,  an 
open  reading  frame,  orfl38,  was  found  3'  to  tmfM.  In  maize,  one  of  the  two  copies  of 
tmfM  was  revealed  5'  to  a transcribed  protein  gene  (Sangare  et  al.  1989). 

Transcript  mapping  with  primer  extension  analysis  of  the  downstream  protein 
coding  genes  demonstrated  that  the  5’  end  of  the  shortest  transcript  of  those  genes  were 
mapped  precisely  to  the  3'  end  of  tmfM  ( Singh  and  Brown  1991).  A role  of  a tRNA  or 
tRNA-like  element  in  the  formation  of  5'  transcript  terminus  of  the  downstream-located  gene 
was  proposed,  whereby  the  tRNA  is  cotranscribed  with  the  protein  coding  gene  and  a 
precise  endonucleolytic  cleavage  of  the  precursor  results  in  a mature  3'  end  of  the  tRNA, 
and  the  5'  transcript  termini  of  the  protein  coding  gene  (Singh  and  Brown  1991). 

Introns  have  been  described  for  several  plant  mitochondrial  genes.  Among 
angiosperm  mtDNAs,  22  different  introns  (Thomson  et  al.  1994)  have  been  detected, 
including  cox2,  and  18  introns  of  genes  nadl,  nad2,  nad4,  nad5  and  nad7  (Bonen  et  al. 
1994,  Wahleithner  et  al.  1990).  These  introns,  ranging  between  700  and  3400  nucleotides, 
are  another  factor  that  enlarge  the  plant  mitochondrial  genome. 

Unlike  the  yeast  mitochondrial  genome,  in  which  no  editing  event  has  been  reported 
yet,  RNA  editing  (Covello  et  al.  1989,  Gualberto  et  al.  1989,  Hiesel  et  al.  1989)  is, 
probably,  ubiquitous  in  the  mitochondria  of  flowering  plants,  both  monocotyledonous  and 
dicotyledonous  species  (Bonnard  et  al.  1992,  Gray  et  al.  1992,  Schuster  and  Brennicke 
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1994).  Editing  occurs  predominantly  within  coding  regions,  and  most  are  C-to-U 
substitutions  (Bonnard  et  al.  1992,  Gray  et  al.  1992,  Schuster  and  Brennicke  1994). 
Transcripts  of  virtually  every  protein-coding  gene  found  in  plant  mitochondria,  except  T- 
urfl3  (Ward  and  Levings  1991)  and  atp6  in  some  lines  of  radish  (Krishnasamy  and 
Makaroff  1994),  are  edited  (Pring  et  al.  1993b)  The  net  effect  of  C-to-U  RNA  editing  in 
plant  mitochondria  is  to  make  encoded  proteins  more  similar  in  sequence  to  their  nonplant 
homologs  (Bonnard  et  al.  1992,  Gray  et  al.  1992,  Schuster  and  Brennicke  1994). 

Cytoplasmic  male  sterility 

In  addition  to  the  expected  complement  of  genes  whose  products  are  involved  in 
electron  transport  and  ATP  synthesis,  plant  mitochondria  also  contain  a large  number  of 
unidentified  open  reading  frames  and  chimeric  genes,  including  genes  associated  with 
cytoplasmic  male  sterility  in  plants  (Braun  et  al.  1992). 

Cytoplasmic  male  sterility  (CMS)  is  maternally  inherited,  results  in  pollen  abortion, 
but  does  not  affect  female  fertility  (Levings  1993).  This  trait  is  of  significant  value 
agronomically  in  hybrid  seed  production  by  eliminating  manual  emasculation.  With  few 
exceptions  (Chen  et  al.  1993),  mitochondrial  DNA  mutations  have  been  demonstrated  to  be 
associated  with  CMS  trait,  and  the  mitochondrial  genome  rearrangements  associated  with 
CMS  in  several  plant  species  have  been  identified  and  characterized  (Braun  et  al.  1992, 
Hanson  et  al.  1991,  Pring  and  Lonsdale  1989). 

The  best  studied  system  is  in  maize  Texas  cytoplasm  (cms-T).  The  cms-T 
cytoplasm  was  extensively  used  to  produce  maize  hybrids  in  the  past.  In  1969  and  1970, 
fungal  diseases,  Southern  com  leaf  blight  and  yellow  com  leaf  blight,  caused  by  Bipolaris 
maydis  race  T and  Phyllosticta  maydis,  respectively,  occurred  which  severely  blighted 
maize  carrying  cms-T  (Braun  et  al.  1992,  Hanson  et  al.  1991,  Pring  and  Lonsdale  1989). 
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Thus,  cms-T  carried  CMS  factors  as  well  as  factors  for  disease  susceptibility.  The  toxins 
produced  from  those  two  pathogens  were  especially  virulent  to  the  cms-T  maize.  An 
unusual  gene,  T-urfl3,  was  associated  with  the  CMS  and  toxin  sensitivity  traits  (Dewey  et 

al.  1986). 

T-urfl3  arose  by  rearrangements  involving  both  intramolecular  and  intermolecular 
recombinational  events  (Dewey  et  al.  1986).  The  T-urfl3  coding  region  contained 
significant  similarity  to  the  3'  flanking  and  coding  sequences  of  the  rm26  gene,  and  the  5’ 
flanking  region  of  T-urf 13  showed  high  similarity  to  the  5'  flanking  region  of  atp6  (Dewey 
et  al.  1986).  In  addition  to  the  recombinant  fragment,  the  cms-T  mitochondrial  genome 
carried  complete,  functional  copies  of  rm26  and  atp6  located  elsewhere.  An  open  reading 
frame,  designated  orf25,  was  downstream  immediately  from  T-urf  13  (Dewey  et  al.  1986). 

T-urf  13  was  expressed  in  cms-T  male-sterile  maize  as  a 13  kD  polypeptide  (Dewey 
et  al.  1987,  Wise  et  al.  1987b).  While  in  cms-T  maize  restored  to  fertility,  URF13  levels 
were  significantly  reduced  (Dewey  et  al.  1987,  Forde  et  al.  1978,  Forde  and  Leaver  1980) 
by  the  nuclear  gene  Rfl  alone  (Dewey  et  al.  1987),  although  both  Rfl  and  Rf2  were 
required  to  restore  fertility.  Rfl  affected  expression  of  T-urf  13  by  processing  some  of  the 
T-urf  13  transcripts  post-transcriptionally  (Dewey  et  al.  1987,  Kennell  et  al.  1987,  Kennell 

and  Pring  1989). 

Revertants  derived  from  tissue  culture  were  male-fertile  and  toxin-insensitive 
(Brettell  et  al.  1980,  Gengenbach  et  al.  1977,  Wise  et  al.  1987a).  Expression  of  T-urf  13  in 
heterologous  systems  of  E.  coli  (Dewey  et  al.  1988),  yeast  (Glab  et  al.  1990,  Huang  et  al. 
1990),  insect  (Korth  and  Levings  1993)  and  tobacco  (Chaumont  et  al.  1995,  von  Allmen  et 
al.  1991)  demonstrated  that  T-urf  13  was  responsible  for  the  toxin  sensitivity  and  strongly 
associated  with  cms  trait.  A model  has  been  proposed  that  URF13  forms  inner 
mitochondrial  membrane  channels  in  the  presence  of  toxin  or  anther-specific  compound(s) 
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(Braun  et  al.  1992,  Levings  1993),  resulting  in  the  permeability  of  the  inner  mitochondrial 
membrane,  and  thus  led  to  disease  sensitivity,  and  probably,  pollen  abortion  (Braun  et  al. 
1992). 

In  petunia,  one  type  of  cms  has  been  described  that  can  be  restored  by  a single 
dominant  Rf  allele.  The  cms  determinant  was  localized  to  a region  of  mtDNA  that  contained 
an  open  reading  frame,  pcf  (Boeshore  et  al.  1985,  Young  and  Hanson  et  al.  1987).  Pcf  was 
composed  of  5’  flanking  and  coding  sequences  of  atp9,  part  of  the  cox2  coding  sequences 
and  an  unidentified  open  reading  frame,  urfS  (Boeshore  et  al.  1985,  Young  and  Hanson  et 
al.  1987).  Like  the  chimeric  gene  T-urfl3  in  maize,  the  duplicated  atp9  and  cox2  were  also 
present  elsewhere  in  the  petunia  mitochondrial  genome  (Boeshore  et  al.  1985,  Young  and 
Hanson  1987).  Pcf  transcript  patterns  were  similar  between  male-sterile  lines  and  fertility 
restored  line,  thus,  the  Rf  gene  acted  at  the  protein  level,  rather  than  at  the  transcriptional 
level  (Nivison  and  Hanson  1989). 

In  radish,  male-sterile  Ogura  cytoplasm  has  been  investigated  in  detail.  An  open 
reading  frame,  orfl38  (Bonhomme  et  al.  1992,  Krishnasamy  and  Makaroff  1993), 
generated  from  multiple  rearrangements,  was  found  5'  to  orfB,  a conventional 
mitochondrial  gene  (Quagliariello  et  al.  1990).  Orfl38  is  only  found  in  Ogura  radish, 
although  sequences  similar  to  orfl38  may  be  present  in  the  normal  radish  (Krishnasamy  and 
Makaroff  1993).  Immediately  5'  to  orfl38  is  a DNA  sequence  present  5'  to  normal  radish 
atp6  (Makaroff  and  Palmer  1988)  and  the  Ogura  nad3/rpsl2.  Contrary  to  the  situation  in 
maize,  the  fertility  restoration  genes  have  no  effect  on  orfl38  transcript  patterns,  but  rather 
at  protein  level.  ORF138  is  produced  in  an  organ-specific  manner  (Krishnasamy  and 
Makaroff  1994). 

In  sorghum,  several  cms  associated  structures,  including  deletions/insertions  within 
open  reading  frames,  and  chimeric  configurations,  have  been  described  (Pring  et  al.  1995). 
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A novel  observation  in  the  sorghum  chloroplast  is  that  of  a deletion  of  55  residues  of  the 
rpoC2  gene,  coding  for  the  6-subunit  of  chloroplast  RNA  polymerase,  was  found  to  be 
correlated  with  male  sterility  in  several  male-sterile  cytoplasms  (Chen  et  al.  1993).  An 
altered  obligate  gene  product,  COXI,  of  male-sterile  cytoplasms  of  IS17218  (9E  group)  and 
IS7920C  (A4  group)  was  also  revealed  to  be  associated  with  cms  trait  in  sorghum  (Bailey- 
Serres  et  al.  1986a,  1986b,  Dixon  and  Leaver  1982).  Mitochondria  from  the  male-sterile  9E 
produced  a 42  kD  COXI  product  (Bailey-Serres  et  al.  1986a,  1986b),  while  mitochondria 
from  other  cytoplasms,  including  Milo  cytoplasm  (Pring  et  al.  1995),  synthesized  a 38  kD 
COXI  polypeptide.  A recombinational  event  at  the  3'  end  of  coxl  generated  an  303  base 
extended  open  reading  frame  for  the  9E  gene,  relative  to  coxl  in  Milo  (Bailey-Serres  et  al. 
1986b).  Two  open  reading  frames,  orf265  (Tang  et  al.  submitted)  and  orfl07  (Tang  et  al. 
in  preparation)  have  been  shown  to  be  associated  with  cms  in  the  male-sterile  IS  1 1 1 2C 
cytoplasm  of  sorghum. 

Other  chimeric  genes  or  genomic  rearrangements,  such  as  orf224/atp6  in  Polima 
Brassica  napus  (Singh  and  Brown  1991),  orf522  in  sunflower  (Kohler  et  al.  1991,  Laver  et 
al.  1991),  pvs  in  the  common  bean  Phaseolus  vulgaris  (Mackenzie  and  Chase  1990),  and 
orf79  in  rice  (Akagi  et  al.  1994),  have  also  been  demonstrated  to  be  associated  with  the  cms 

trait. 


Transcription  Initiation  in  Mitochondria 

Transcription  Initiation  of  Animal  Mitochondrial  DNA 

Transcription  of  human  mtDNA  is  initiated  from  two  major  promoters,  the  heavy 
strand  promoter  (HSP)  and  the  light  strand  promoter  (LSP)  located  in  the  D-loop  region 
(Clayton  1991).  Most  of  the  genes  are  transcribed  from  the  heavy  (H)-strand,  and  virtually 
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the  whole  mitochondrial  genome  is  transcribed.  The  DNA  sequence  and  protein  element 
requirements  for  specific  initiation  of  transcription  have  been  extensively  characterized  by 
utilizing  an  in  vitro  transcription  system  (Clayton  1991).  Each  of  the  two  promoters,  which 
spans  about  50  nucleotides,  requires  two  distinct  sequence  elements  for  activity.  Analysis 
of  deletion  study  (Chang  and  Clayton  1984),  site-specific  mutagenesis  (Hixson  et  al.  1985), 
and  linker-scanning  mutagenesis  (Topper  and  Clayton  1989)  revealed  that  one  element  of  8- 
bp  surrounding  the  transcription  initiation  site,  5'-CC(G)CCAAAA(+4)-3',  is  absolutely 
required  for  transcription  activity.  The  second  element  is  located  upstream  of  the 
transcription  initiation  site  and  is  required  for  high  levels  of  specific  initiation  (Clayton 
1991).  Thus  these  two  parts  of  DNA  elements  give  rise  the  bipartite  feature  to  human 
mitochondrial  promoters  (Jaehning  1993,  Shadel  and  Clayton  1993).  Human  mitochondrial 
extracts  have  been  partially  purified  into  two  active  fractions:  one  containing  a nonspecific 
mtRNA  polymerase  and  the  other  containing  a transcriptional  factor  (h-mtTFA),  which  is 
required  for  high  levels  of  specific  transcription  initiation  by  binding  to  the  upstream  element 
of  HSP  and  LSP  (Clayton  1991). 

The  mouse  LSP  is  similar  to  the  human  mtDNA  promoters:  A DNA  sequence 
around  the  RNA  start  site  is  crucial  for  transcription,  and  the  upstream  sequences  are 
required  for  efficient  transcription  (Chang  et  al.  1986a).  Among  the  mitochondrial 
promoters  analyzed  to  date,  the  HSP  in  mouse  mtDNA  appears  to  be  unique.  The  DNA 
sequences  required  for  transcription  are  located  at  positions  -41  to  -17,  and  the  sequences 
surrounding  transcription  initiation  site  are  not  required  for  the  specific  initiation  of 
transcription  (Chang  et  al.  1986b). 

Unlike  the  bipartite  human  mitochondrial  promoters,  the  promoter  sequences  of 
Xenopus  laevis  mtDNA  (Bogenhagen  et  al.  1988)  contain  a consensus  octanucleotide 
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sequence,  5’-ACGTTATA-3',  which  is  by  itself  sufficient  for  accurate  transcription 
initiation. 

The  overall  vertebrate  mitochondrial  transcription  machinery  is  highly  selective,  and 
the  vertebrate  mtDNA  promoters  characterized  thus  far  do  not  have  significant  sequence 
similarities  with  one  another  (Clayton  1991).  The  specificity  can  also  be  shown  by  the  fact 
that  in  heterologous  transcription  assays,  mitochondrial  RNA  polymerase  from  human  cells 
does  not  initiate  transcription  at  mouse  mtDNA  promoters  and  vice  versa  (Shadel  and 
Clayton  1993).  Consequently,  mitochondrial  promoter  sequences,  at  least  in  vertebrate 
species,  may  be  specific  to  individual  or  closely  related  species  (Shadel  and  Clayton  1993). 

Transcription  Initiation  of  Yeast  Mitochondrial  DNA 

The  number  of  promoters  of  yeast  mitochondrial  genome  have  been  determined  by 
in  vitro  capping  with  guanylyltransferase  and  direct  RNA  sequencing  (Christianson  et  al. 
1983).  Unlike  the  system  in  the  human  mitochondrial  genome,  with  two  closely  located 
promoters  (Clayton  1990),  some  20  different  primary  transcripts  were  identified  for  yeast 
mtDNA  (Christianson  et  al.  1983),  and  these  promoters  are  spread  throughout  the  genome. 
To  search  for  DNA  sequences  that  define  the  yeast  mitochondrial  promoter,  sequences 
surrounding  promoter  regions  were  analyzed  and  a highly  conserved  nine-nucleotide 
sequence,  5'-ATATAAGTA(+l)-3',  was  obvious  among  all  transcriptional  start  points 
(Christianson  et  al.  1983).  Further  analysis  of  deletion  study  and  site-directed  mutagenesis 
(Biswas  et  al.  1990)  in  an  in  vitro  transcription  system  revealed  the  critical  region  required 
for  promoter  function  to  be  certain  nucleotides  within  the  nine-nucleotide  element.  In 
general,  the  consensus  sequence  seemed  to  be  the  optimal  promoter. 

Yeast  mtRNA  polymerase  is  composed  of  a 145-kD  core  subunit  encoded  by  the 
nuclear  RP041  gene  (Greenleaf  et  al.  1986),  and  a 45  kD  specificity  factor,  mtTFB, 
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encoded  by  the  nuclear  MTF1  gene  (Costanzo  and  Fox  1990).  Analysis  of  amino  acid 
sequence  of  core  subunit  revealed  that  the  MTF1  gene  is  homologous  to  the  bacteriophage 
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T7  and  T3  RNA  polymerases  (Costanzo  and  Fox  1990).  Although  the  core  alone  can 
initiate  transcription,  specific  factors  are  required  for  specific  transcriptional  activity 
(reviewed  by  Jaehning  1993).  Mutational  analysis  of  mtTFB,  a 341 -amino-acid  protein, 
defined  two  regions  that  are  important  for  normal  function  both  in  vivo  and  in  vitro  (Jang 
and  Jaehning  1991).  The  predicted  amino  acid  sequence  of  these  two  regions  of  mtTFB 
revealed  limited  similarity  to  the  conserved  region  2 and  3 of  bacterial  sigma  factors  (Jang 
and  Jaehning  1991).  These  regions  have  been  proposed  to  be  involved  in  recognition  of  the 
bacterial  promoter  (Dombroski  et  al.  1992),  DNA  unwinding  (Waldburger  and  Succkind 
1994),  and  interactions  between  the  sigma  factors  and  the  core  RNA  polymerase  (Zhou  et 
al.  1992).  However,  mutations  in  the  sigma  crucial  promoter-contacting  region  did  not 
eliminate  the  ability  of  mtTFB  to  initiate  transcription  specifically  in  vitro  (Shadel  and 
Clayton  1995),  suggesting  a different  mechanism  of  promoter  recognition  of  yeast  RNA 
polymerase  than  that  of  bacterial  RNA  polymerases. 

Transcription  Initiation  of  Plant  Mitochondrial  DNA 

The  analysis  of  plant  mitochondrial  genomes  is  usually  much  more  complicated  than 
those  of  animal  and  yeast  systems  because  the  complex  genome  organization  and  frequent 
recombination  events  in  plant  mitochondrial  genomes,  which  may  raise  the  copy  number  of 
a gene  or  reshuffling  of  transcriptional  regulatory  elements  (Gray  et  al.  1992).  In  the  latter 
case,  a homologous  gene,  in  different  plant  mitochondrial  genomes,  may  have  diverged  5' 
or  3'  flanking  sequences,  which  may  include  transcriptional  regulatory  elements.  A loose 
consensus  sequence  motif  is  expected  for  plant  mtDNA  promoters  (Gray  et  al.  1992), 
compared  with  those  highly  conserved  motifs  for  animal  and  yeast  mtDNAs. 
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Transcript-mapping  studies  with  primer  extension,  SI  nuclease  mapping  and  in  vitro 
capping  with  guanylyltransferase,  transcription  initiation  sites  of  a number  of  maize 
mitochondrial  genes,  including  rmJ8,  rm26,  atpl,  atp6,  cox3,  atp9,  T-urfl3,  and  orf221, 
have  been  identified  (Kennell  and  Pring  1989,  McCarty  et  al.  1988,  Mulligan  et  al.  1988a, 
1988b,  1991).  Multiple  initiation  sites  have  been  detected,  such  as  cox2  and  atp9  with  as 
many  as  five  or  six  initiation  sites  (McCarty  et  al.  1988,  Mulligan  et  al.  1991).  A repeated 
segment  5'  to  T-urfl3  and  atp6  was  found  to  drive  transcription  of  both  T-urfl3  and  atp6 
(Kennell  and  Pring  1989). 

In  hope  of  identifying  DNA  motifs  that  define  maize  mitochondrial  promoters, 
sequences  surrounding  the  transcription  initiation  sites  were  compared  for  shared  elements. 
A consensus  sequence,  (AyT)CRTA(G/T)A(A/T)AAA  was  revealed,  and  a sequence  of 
CRT  A is  the  most  highly  conserved  feature  of  transcription  initiation  regions  (Mulligan  et  al. 
1991).  In  addition,  the  repeated  motif  G(A/T)3-4  or  C(A/T)3-4  is  found  upstream  of  this 
element  in  some  genes  (Mulligan  et  al.  1991).  However,  DNA  sequences  of  some  initiation 
regions,  such  as  five  initiation  sites  of  atp9,  do  not  fit  this  proposed  motif,  even  within  the 
core  CRTA  region  (Mulligan  et  al.  1991). 

Similarly,  several  5'  transcript  termini  of  wheat  mitochondrial  genes  ( atpl/atp9 , 
cox2,  cox3,  and  orf25)  were  also  identified  by  isolating  and  directly  sequencing  capped 
transcripts  (Covello  et  al.  1991).  A consensus  sequence,  GCRTAtA,  with  the  initiation  site 
2-3  nucleotides  downstream  of  the  final  A,  was  defined  (Covello  et  al.  1991). 

Transcription  initiation  sites  of  a rice  gene  cluster,  rps3-rpll6-nad3-rpsl2,  were  also 
determined  (Nakazono  et  al.  1995a).  Two  initiation  sites  were  identified:  one  was  located 
between  rpll6  and  nad3 , and  the  other  one  was  mapped  upstream  of  rps3  (Nakazono  et  al. 
1995a).  Alignment  of  the  nucleotide  sequences  around  the  transcription  initiation  sites  of 
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these  genes,  as  well  as  other  rice  genes  ( orf483 , atp9  and  cob),  revealed  a conserved  CRTA 
element  (Nakazono  et  al.  1995b). 

Using  similar  transcript-mapping  approaches,  a number  of  5'  primary  transcript 
termini  of  mitochondrial  genes  of  dicots  were  mapped  to  the  corresponding  DNA 
sequences.  In  soybean,  three  mitochondrial  gene,  atp9,  RNA  b/c  and  RNA  e (RNA  b,  c 
and  e are  three  species  of  soybean  mtRNA  of  approximately  150,  135  and  80  nt, 
respectively)  transcription  initiation  sites  have  been  identified  (Auchincloss  and  Brown 
1989,  Brown  et  al.  1991).  Alignment  of  initiation  sites  revealed  that  atp9  and  b/c  transcripts 
shared  the  CRTA  element,  as  proposed  for  maize  system,  whereas  e transcript  lacked  this 
conserved  motif  (Brown  et  al.  1991). 

In  Oenothera,  transcription  initiation  sites  have  been  determined  by  in  vitro  capping 
of  mtRNA  from  Oenothera  berteriana  (Binder  and  Brenmcke  1993b).  At  least  15  initiation 
sites  were  identified,  suggesting  that  the  Oenothera  mitochondrial  genome  contained  a 
minimum  number  of  15  promoters  (Binder  and  Brennicke,  1993b).  Primary  5 transcript 
termini  of  cox2,  atpl  and  an  18  S rRNA  precursor  have  also  been  determined.  Alignment 
of  the  sequences  surrounding  the  5’  transcript  termini  revealed  a conserved  sequence  block 
of  29  nt,  aAaNTtg AaAtNTCaT AaGaG AagcaaaaN  (Binder  and  Brennicke,  1993),  with  a 9- 
nt  core  (5'-CRTAaGaGA-3’)  conserved  with  soybean  transcription  initiation  sites. 

In  potato,  the  transcription  initiation  site  of  rm26  was  mapped  (Binder  et  al.  1994). 
In  contrast  to  the  situation  of  rml8  in  Oenothera  (Binder  and  Brennicke,  1993b),  the 
transcription  initiation  site  of  rm26  was  determined  to  be  the  mature  5'  end  of  rm26 
transcript  (Binder  et  al.  1994).  Thus,  in  potato  mitochondria,  the  mature  5'  end  of  the  rm.26 
is  created  by  transcription  initiation  without  any  further  5'  processing.  The  nucleotide 
sequence  surrounding  the  initiation  site  shows  only  limited  similarity  to  other  putative 
promoter  sequence  from  dicot  plant  mitochondria,  and  lacks  the  CRTA  motif  (Binder  et  al. 
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1994).  The  transcription  initiation  site  of  another  potato  gene,  atp9,  has  also  been 
identified,  and  the  sequence  alignment  revealed  that  the  atp9  promoter  does  not  contain  the 
CRTA  element  (Lizama  et  al.  1994). 

The  transcription  initiation  site  of  a tRNA  has  also  been  investigated  in  Oenothera 
(Binder  and  Brennicke  1993a).  A tRNA^e  gene  transcription  initiation  site  was 
determined  by  primer  extension  and  in  vitro  capping  experiments.  Sequences  covering  the 
transcription  start  point  contain  the  CRTA  conserved  block,  suggesting  that  tRNA,  rRNA 
and  mRNA  can  be  transcribed  from  homologous  promoters  (Binder  and  Brennicke  1993a). 

The  availability  of  in  vitro  transcription  system,  initially  established  in  wheat 
mitochondria  (Hanic- Joyce  and  Gray  1991),  enables  functional  analysis  of  plant 
mitochondrial  promoters.  With  this  system,  the  cox2  promoter  of  wheat  was  localized  to  a 
372-bp  region  spanning  positions  -56  to  -427,  within  the  sequence  GTATAGTAAGTA 
(initiation  site  underlined)  (Hanic-Joyce  and  Gray  1991).  A YRTA  (TATA)  motif  is 
apparent  immediately  5'  to  the  initiation  site. 

In  maize,  Rapp  and  Stem  (1992,  Rapp  et  al.  1993)  were  able  to  dissect  the  maize 
atpl  promoter  with  a partially  fractionated  maize  mitochondrial  extract.  An  1 1 -nucleotide 
element,  ( A/T)CRT A(G/T) A(  A/T) AAA,  surrounding  the  initiation  site,  was  identified  to  be 
indeed  critical  for  promoter  function.  However,  this  DNA  block  alone  was  insufficient  to 
direct  transcription  initiation  in  vitro.  In  contrast,  deletion  or  alteration  of  upstream 
G(A/T)3-4  repeats  had  no  effect  on  transcription  in  vitro.  Specific  transcription  initiation  of 
maize  cox3  and  atp6  were  also  obtained  in  the  in  vitro  system.  The  promoter  of  the  rml8 
gene,  however,  was  not  recognized  by  this  RNA  polymerase  preparation,  although  the 
rm!8  promoter  bore  a YRTA  (corresponding  to  the  CRTA  proposed  initially)  core  motif. 
These  ramifications  suggested  that  a more  complex  transcription  machinery  may  required  for 
faithful  initiation  of  plant  mtDNA. 
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Further  analysis  with  linker-scanning  and  point  mutagenesis  (Rapp  et  al.  1993) 
showed  that  the  atpl  promoter  consisted  of  a central  domain  extending  from  -7  to  +5,  and 
an  upstream  domain  of  1 to  3 bp,  centered  around  -11  to  -12.  The  core  region  bearing 
YRTA  corresponded  to  the  conserved  CRT  A proposed  by  Mulligan  et  al  (1991). 

A promoter  of  perennial  teosinte  cox2  gene  was  also  defined  in  an  in  vitro 
transcription  system  with  maize  extract  (Newton  et  al.  1995),  and  a conserved  YRTA  motif 
was  found  within  the  promoter  region.  Another  cox2  promoter,  which  is  nuclear  gene  Met - 
dependent  and  is  only  active  when  the  dominant  allele  of  Met  is  present,  was  not  able  to 
initiate  transcription  in  this  in  vitro  maize  system,  although  the  in  vitro  extract  was  made 
from  a maize  line  that  carried  the  dominant  Met  allele  in  the  nucleus  (Newton  et  al,  1995). 

To  date,  nothing  is  known  regarding  the  nuclear-encoded  factors  involved  in  the 
transcription  initiation  of  plant  mitochondrial  genes.  From  studies  of  animal  and  yeast 
systems  (Jaehning  1993,  Shadel  and  Clayton  1993),  plant  mitochondrial  RNA  polymerase 
is  expected  to  be  encoded  by  the  nuclear  genome.  Based  on  the  similarity  (YRTA)  in 
promoter  structure,  it  has  been  speculated  that  the  polymerase  from  fungi,  vertebrates  and 
plants  may  have  a similar  basic  structure  of  a core  polymerase  which  is  related  to  those 
utilized  by  bacteriophage  T7  and  T3,  and  an  important  specific  factor  for  site  selection 
(Jaehning  1993). 


Regulation  of  Mitochondrial  Gene  Expression 

Compared  with  the  studies  of  nuclear  gene  expression,  regulation  of  mitochondrial 
gene  expression  is  poorly  understood.  Experiments  of  incorporating  5-bromodeoxyuridine 
into  mtDNA  of  a mouse  fibroblast  cell  line  lacking  the  cytosolic  thymidine  kinase  suggested 
that  transcription  of  mtDNA  is  constitutive  and  not  rate  limiting  for  protein  synthesis  (Attardi 


21 


and  Schatz  1988).  Posttranscriptional  regulation  may  play  a major  role  in  mammalian 
mitochondria,  as  exemplified  by  the  differential  stability  of  HeLa  cell  mitochondria.  This 
differential  stability  contributes  to  the  20-  to  150-fold  higher  copy  number  of  individual 
tRNAs  relative  to  the  individual  mRNAs  (Attardi  and  Schatz  1988).  Regulation  may  also 
occur  at  the  translation  level  of  mammalian  mitochondrial  genes,  although  the  mechanism  is 
not  understood  yet. 

In  yeast  mitochondria,  regulation  of  transcriptional  initiation  plays  a major  role  in 
differential  gene  expression.  As  much  as  a 20-fold  difference  was  revealed  between  the 
strongest  and  the  weakest  promoters  (Costanzo  and  Fox  1990).  Mitochondrial  transcription 
may  be  modulated  under  different  physiological  conditions  (Costanzo  and  Fox  1990).  The 
steady  state  level  of  mitochondrial  mRNAs  in  glucose-depressed  cells  were  shown  to  be 
three-  to  six-fold  higher  than  the  mRNAs  in  repressed  cells  (Costanzo  and  Fox  1990).  Yet, 
there  is  also  evidence  of  no  decrease  in  mtDNA  transcription  during  anaerobic  growth 
(Attardi  and  Schatz  1988). 

Studies  of  differential  gene  expression  in  plant  mitochondria  have  been  conducted. 
Labeling  of  isolated  mitochondria  from  organs  of  maize  demonstrated  that  mitochondrial 
proteins  synthesized  in  organelle  differ  among  shoots,  cobs,  tassels  and  scutella  of  maize 
(Newton  and  Walbot  1985).  Expression  of  a cms-associated  gene,  orfl38,  in  fertility- 
restored  Ogura  radish  was  revealed  as  an  organ-specific  reduction  at  the  protein  level  in 
flowers  and  leaves,  but  not  roots  (Krishnasamy  and  Makaroff  1994).  Localization  of  two 
specific  mitochondria-encoded  proteins  in  petunia  demonstrated  that  mitochondria  in  the 
same  tissue  could  differentially  express  mitochondrially  encoded  proteins  (Conley  and 
Hanson  1994).  The  ATP1  protein  was  found  at  high  levels  in  sporogeneous  tissue  and 
vascular  bundle  and  at  low  levels  in  the  connective  and  subepidermal  tissues.  In  contrary, 
the  COX2  protein  was  localized  at  high  levels  in  the  connective  and  subepidermal  tissues  but 
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at  low  levels  in  sporogeneous  and  vascular  bundle  (Conley  and  Hanson  1994).  Apart  from 
the  regulation  at  the  protein  level,  differences  at  the  transcript  level  have  also  been  studied. 
Transcripts  and  protein  of  atpl  and  cms-associated  orf522  in  sunflower  are  abundant  in 
young  meiocytes  of  cells  and  tapetal  cells,  and  were  reduced  in  the  presence  of  fertility 
restoration  genes  (Smart  et  al.  1994).  Nevertheless,  there  is  no  evidence  to  date  that 
differences  at  the  transcript  or  protein  levels  are  correlated  with  specific  promoters. 


MATERIALS  AND  METHODS 


Genetic  Stocks 

Sorghum  lines  Tx398,  IS1 1 12C  and  A3Tx398  were  provided  by  Dr.  K.  F. 

Schertz,  USDA-ARS,  College  Station,  Texas,  and  were  maintained  by  Dr.  D.  R.  Pring. 
Tx398  is  a normal  male-fertile  line.  IS1 1 12C  is  a male-fertile  line,  carrying  the  male-sterile 
IS1112C  cytoplasm.  A3Tx398  is  male-sterile  line,  carrying  the  IS1 1 12C  cytoplasm  in  the 
Tx398  nuclear  background. 

The  maize  hybrid  line  was  provided  by  Pioneer  Hybrid  International,  Johnston, 

Iowa. 


Isolation  of  Mitochondrial  DNA.  RNA  and  Protein 


Isolation  of  Mitochondria 

Mitochondria  were  prepared  from  five  to  seven-day-old  etiolated  coleoptiles  of 
sorghum  or  maize  as  previously  described  (Chase  and  Pring,  1986).  Tissues  were  ground 
in  a mortar  and  pestle  or  a Waring  blender  with  10  ml/gm  tissue  or  5 ml/gm  tissue  (for 
RNA  isolation)  of  ice-cold  buffer  A [0.5  M sucrose,  5 mM  Na2EDTA,  0.1%  BSA,  2 mM 
mercaptoethanol,  5 mM  DIECA  (for  DNA  isolation),  50  mM  Tris,  pH7.5],  and  filtered 
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through  two  layers  of  cheesecloth  and  two  layers  of  miracloth.  The  filtrate  was  centrifuged 
at  1000  g for  10  minutes  at  4C  to  remove  cell  debris  and  nuclei,  and  the  supernatant  was 
transferred  to  a clean  tube  and  centrifuged  at  15,900  g for  20  minutes  at  4C  to  pellet 
mitochondria. 

Isolation  of  mitochondrial  DNA 

Mitochondrial  DNA  was  isolated  using  the  procedure  described  by  Chase  and  Pring 
(1986).  After  mitochondrial  pellets  were  prepared,  they  were  resuspended  in  45  ml/ 10-30 
gm  tissue  of  ice-cold  buffer  G (0.3  M sucrose,  50  mM  Tris,  pH7.5)  and  centrifuged  at 
1000  g for  10  minutes  at  4C.  The  supernatant  was  transferred  to  a clean  tube  and 
centrifuged  at  15,900  g for  20  minutes  at  4C.  The  pellet  was  resuspended  in  ice-cold 
buffer  G with  a small  brush  to  10  ml/50  gm  tissue.  Magnesium  chloride  and 
deoxynuclease  were  then  added  to  the  final  concentration  of  10  mM  and  20  pg/ml, 
respectively,  and  the  sample  was  placed  at  4C  for  60  minutes.  At  the  end  of  incubation,  the 
sample  was  diluted  to  20  ml  with  ice-cold  buffer  G,  underlaid  with  20  ml  of  shelf  buffer 
(0.6  M sucrose,  20  mM  Na2EDTA,  10  mM  Tris,  pH7.2)  and  centrifuged  at  15,900  g for 
20  minutes  at  4C.  The  pellet  was  resuspended  in  shelf  buffer  to  10  ml/50-100  gm  tissue, 
and  centrifuged  at  15,900  g for  10  minutes  at  4C.  The  mitochondrial  pellet  was 
resuspended  in  2.5  ml/50  gm  tissue  of  lysis  buffer  (50  mM  Tris,  pH8.0,  20  mM 
Na2EDTA).  Sodium  dodecyl  sulfate  and  Proteinase  K were  added  to  0.5%  and  100 
(Xg/ml,  respectively.  The  sample  was  mixed  gently  and  incubated  at  37C  for  60  minutes. 

At  the  end  of  the  incubation,  a 0.026  volume  of  8 M ammonium  acetate  was  added  and  the 
mixture  was  extracted  twice  with  an  equal  volume  of  75%  water-saturated  phenol,  25% 
chloroform-isoamyl  alcohol  (24:1)  and  once  with  equal  volume  chloroform-isoamyl  alcohol 
(24:1)  by  emulsifying  thoroughly  and  centrifuging  at  7,000  g for  15  minutes.  The  upper 
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aqueous  layer  was  collected  and  the  DNA  was  precipitated  with  0. 1 volume  of  8 M 
ammonium  acetate  and  two  volumes  of  absolute  ethanol  at  -80C  for  60  minutes.  The  pellet 
was  collected  by  centrifuging  at  15,900  g for  30  minutes  at  4C,  rinsed  twice  with  absolute 
ethanol  by  centrifuging  for  10  minutes,  and  drained  dry.  The  DNA  was  resuspended  in 
200  (j.1/50  gm  tissue  of  sterile  distilled  H2O,  and  10  pi  of  ribonuclease  mixture  (1.02  mg/ml 
RNase  A,  10,000  U/ml  RNase  Tl,  boiled  for  5 minutes)  was  added  and  incubated  at  37C 
for  30  minutes. 

Isolation  of  Mitochondrial  RNA  via  Guanidine  Thiocyanate  and  LiC] 

Mitochondria  were  prepared  as  described  above,  and  mitochondrial  RNA  (mtRNA) 
was  isolated  by  the  method  of  Kennell  and  Pring  (1989).  The  mitochondrial  pellet  was 
resuspended  in  10  ml  buffer  A,  underlayed  with  20  ml  shelf  buffer  and  centrifuged  at 
15,900  g for  20  minutes.  The  pellet  was  resuspended  in  1-2  ml  of  buffer  A and  a 1.5  ml  of 
guanidine  thiocyanate  lysis  solution  (6  M guanidine  thiocyanate,  50  mM  Tris,  pH7.6,  10 
mM  Na2EDTA  and  2%  N-Lauroyl  sarcosine)  was  added  to  the  sample.  The  lysate  was 
dispersed  with  a brush,  and  0.026  volume  of  8 M ammonium  acetate  was  added.  The 
sample  was  extracted  twice  with  equal  volume  of  75%  water-saturated  phenol,  25% 
chloroform-isoamyl  alcohol  (24:1)  and  once  with  equal  volume  of  chloroform-isoamyl 
alcohol  (24: 1).  The  upper  aqueous  layer  was  collected,  brought  to  5 ml/ 100  gm  tissue  with 
buffer  A,  and  a 0.2  volume  of  sterile  12  M LiCl  was  added,  and  incubated  overnight  at  0C. 
The  sample  was  centrifuged  at  15,900  g for  20  minutes  at  4C,  and  the  pellet  was 
resuspended  in  5 ml/100  gm  tissue  of  sterile  distilled  H2O.  A 0.2  volume  of  sterile  12  M 
LiCl  was  added  and  incubated  overnight  at  0C.  The  sample  was  centrifuged  at  15,900  g 
for  20  minutes  and  the  pellet  was  resuspended  in  200  pi/ 100  g tissue  of  sterile  distilled 
H2O. 
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Isolation  of  Mitochondrial  RNA  via  Percoll  Gradients 

Mitochondria  were  prepared  as  described  above  except  2 mM  mercaptoethanol  was 
replaced  with  4 mM  dithiothreitol  (DTT).  The  pellet  was  resuspended  in  ice-cold  0.75  X 
mitochondrial  wash  buffer  (IX:  0.25  M sucrose,  0.5  mM  Na2EDTA,  15%  glycerol,  1 
mM  DTT,  20  mM  Tris,  pH7.5)  to  a final  volume  of  6 ml/100  gm  tissue.  Percoll  gradients 
(Neuburger  et  al.  1982)  were  prepared  with  45%,  21%  and  13.5%  Percoll  solutions  (in  0.5 
M sucrose,  2 mM  Na2EDTA,  0.4%  BSA,  2 mM  DTT  and  20  mM  Tris,  pH7.5)  and  kept  at 
4C.  The  mitochondria  were  carefully  layered  onto  the  Percoll  gradients  and  centrifuged  at 
23,500  rpm  in  a Beckman  SW27  rotor  for  30  minutes  at  4C.  The  materials  above  the  45- 
21%  interface  were  removed,  the  mitochondria  from  the  45-21%  interface  were  collected, 
and  diluted  with  a total  of  10  volumes  of  0.75  X mitochondrial  wash  buffer  and  centrifuged 
at  15,900  g for  20  minutes  at  4C.  The  pellet  was  collected,  diluted  with  5 volumes  of  0.5 
X mitochondrial  wash  buffer,  and  centrifuged  at  15,900  g for  20  minutes.  The 
mitochondrial  pellet  was  collected,  resuspended  in  3-4  ml  of  0.5  X mitochondrial  wash 
buffer,  transferred  to  a 1.5  ml  Eppendorf  tube,  and  centrifuged  at  16,000  g for  5 minutes. 

The  mitochondrial  pellet  was  resuspended  in  2 ml  GTC  lysis  buffer  (4  M 
guanidium  thiocyanate,  0.5%  sarkosyl,  0.1  M mercaptoethanol,  25  mM  sodium  citrate, 
pH7.0)  with  a brush,  and  allowed  to  incubate  at  25C  for  5 minutes.  A 0. 1 volume  of  2 M 
sodium  acetate  (pH4.0),  1 volume  of  water-saturated  phenol  and  0.2  volume  of 
chloroform-isoamyl  alcohol  (49:1)  were  added,  emulsified  thoroughly,  and  placed  on  ice 
for  15  minutes.  The  sample  was  centrifuged  at  7,000  g for  10  minutes.  The  upper 
aqueous  layer  was  collected,  and  the  RNA  was  precipitated  with  one  volume  of 
isopropanol  at  -80C  for  60  minutes.  The  pellet  was  collecting  by  centrifuging  at  15,900  g 
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for  20  minutes,  rinsed  twice  with  absolute  ethanol,  and  drained  dry.  The  dried  RNA  was 
resuspended  in  200  pi  sterile  distilled  H2O  /100  gm  tissue. 

Isolation  of  Mitochondria  for  Protein  Purification 

Mitochondria  were  prepared  as  described  above  except  with  modified  buffer  A (0.5 
M sucrose,  50  mM  3-[N-Morpholino]propane-sulfonic  acid  (MOPS),  0.1%  BSA,  4mM 
DTT,  10  mM  EGTA,  pH7.5)  and  modified  Percoll  solution  (in  0.5M  sucrose,  2 mM 
EGTA,  0.4%  BSA,  2 mM  DTT,  20  mM  MOPS,  pH7.5).  After  mitochondria  were  pelleted 
in  a 1.5  ml  Eppendorf  tube,  the  pellet  was  quickly  frozen  in  liquid  nitrogen  and  placed  at 
-80C. 


Recombinant  DNA  Methods 


Cloning 

In  this  study,  plasmids  pUC19  and  pBSSK(-)  were  utilized  as  vectors  for  cloning. 
The  vector  DNAs  were  digested  with  appropriate  restriction  enzymes,  and  the  desired 
fragments  were  isolated  by  gel  electrophoresis.  One  hundred  nanograms  of  the  vector 
DNA  and  a triple  molar  amount  of  foreign  DNA  were  added  to  a ligation  mixture  of  66  mM 
Tris,  pH7.5,  5 mM  MgCl2,  1 mM  DTT,  1 mM  ATP,  and  1 unit  of  T4  DNA  ligase  (5  unit 
for  blunt  end  ligation)  (Boehringer  Mannheim  Biochemicals).  The  ligation  reactions  were 
performed  at  25C  for  16  hours.  The  ligated  DNAs  were  added  to  a tube  containing  200  pi 
competent  Escherichia  coli  cells  DH5a  (the  procedure  for  making  competent  E.  coli  is 
shown  below),  mixed  and  stored  on  ice  for  30  minutes.  The  tube  was  then  transferred  to 
42C  water  bath  for  40  seconds  and  chilled  on  ice  for  5 minutes.  An  equal  volume  of  LB 
medium  (1%  of  bacto-tryptone,  0.5%  bacto-yeast  extract,  and  1%  of  NaCl)  was  added  to 
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the  tube  and  the  tube  was  placed  at  37C  for  30  minutes.  The  transformed  competent  cells 
were  spread  onto  agar  LB  medium  with  20  p.g/ml  of  ampicillin  and  800  (ig  of  X-gal  (5- 
bromo-4-chloro-3-indolyl-B-D-galactoside),  and  incubated  at  37C  for  16  hours. 

To  make  competent  DH5a,  bacteria  were  streaked  from  a frozen  stock  (stored  at 
-80)  onto  a LB  agar  plate,  and  were  incubated  at  37C  for  16  hours.  A single  colony  was 
picked  up  from  the  plate  and  transferred  into  50  ml  of  LB  broth.  The  culture  was  incubated 
at  37C  for  about  3 hours  with  vigorous  shaking.  The  concentration  of  bacterial  cells  was 
monitored  by  measuring  the  OD600  and  did  not  exceed  10^  cells/ml.  The  cells  were  then 
transferred  aseptically  to  a sterile  tube  and  chilled  on  ice  for  10  minutes.  The  bacterial  cells 
were  recovered  by  centrifugation  at  1000  g for  10  minutes  at  4C.  The  medium  was 
discarded  and  the  pellet  was  resuspended  in  10  ml  of  ice-cold  0.1  M CaCl2  and  stored  on 
ice.  The  cells  were  recovered  by  centrifugation  at  1000  g for  10  minutes  at  4C,  and  the 
pellet  was  resuspended  in  2 ml  of  ice-cold  0. 1 M CaCl2-  At  this  stage,  the  cells  were  ready 
for  used  in  transformation.  The  cells  could  also  be  dispensed  into  aliquots  that  could  be 
frozen  at  -80C. 

After  transformation,  bacterial  colonies  that  contain  the  recombinant  plasmid  were 
identified  by  a-complementation  (Sambrook  et  al.  1989)  in  which  deletion  mutants  of  the 
operator-proximal  element  of  bacteria  lacZ  gene  in  the  bacteria  host  were  complemented  by 
B-galactosidase-negative  mutants  of  plamids  which  contain  the  intact  operator-proximal 
region.  Due  to  the  a-complementation,  the  Lac+  bacteria  produced  blue  colonies  in  the 
presence  of  X-gal  (Sambrook  et  al.  1989).  The  recombinant  plasmids,  however,  formed 
white  colonies  due  to  the  insertion  of  foreign  DNA  and  thus  were  incapable  of  a- 
complementation. 
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Recombinant  Plasmids 

The  recombinant  plasmid  clones  utilized  in  the  investigation  were  listed  in  Table  1. 


Preparation  of  Plasmids 
By  triton  lvsis  method 

Transformed  cells  were  grown  in  45  ml  LB  broth  containing  0.1  g/1  ampicillin  at 
37C  with  agitation  for  16  hours.  The  cells  were  collected  by  centrifuging  10  minutes  at 
6000  g,  and  resuspended  in  0.58  ml  ST  solution  (25%  sucrose,  50  mM  Tris,  pH8.0).  Hen 
egg  white  lysozyme  (0.195  ml  of  10  mg/ml)  was  added  to  the  tube,  vortexed,  and  allowed 
to  stand  5 minutes  on  ice,  followed  by  adding  0.475  ml  of  0.2  M Na2EDTA,  pH8.  The 
sample  was  placed  on  ice  for  5 minutes,  and  1.25  ml  Triton  lysis  solution  (0.1%  Triton  X- 
100,  50  mM  Tris,  pH8.0,  6.25  mM  Na2EDTA)  was  added.  The  sample  was  rotated 
gently,  and  placed  on  ice  for  15  minutes.  The  cells  were  pelleted  by  centrifuging  at  15,900 
g for  20  minutes  at  4C.  The  supernatant  was  collected,  and  extracted  with  phenol  and 
chloroform,  precipitated  and  dried,  as  described  above.  The  dried  DNA  was  resuspended 
with  500  p.l/50ml  culture  of  sterile  distilled  H2O,  and  a 10  (ll  RNase  mixture  (1.02  mg/ml 
RNase  A,  10,000  U/ml  RNase  Tl,  boiled  for  5 minutes)  was  added,  and  incubated  at  37C 
for  30  minutes. 

Bv  boiling  method 

E.  coli  cells  were  grown  overnight  in  TB  media  (terrific  broth,  per  liter:  12  g Bacto 
tryptone,  24  g Bacto  yeast  extract,  4 ml  glycerol,  17  m M KH2PO4,  72  mM  K2HPO4) 
(Sambrook  et  al.  1989).  On  the  following  day,  1.5  ml  saturated  culture  was  transferred  to 
a 1.5  ml  Eppendorf  tube  and  the  cells  were  pelleted  by  spinning  30  seconds  in  a microfuge 
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at  maximum  speed.  The  supernatant  was  discarded  and  an  additional  1 .5  ml  culture  was 
added  to  the  tube  and  the  cells  were  pelleted.  The  pellet  was  resuspended  thoroughly  in  a 
400  |ll  ice-cold  STET  buffer  (8%  sucrose,  0.05%  Triton  X-100,  50  mM  Na2EDTA,  50 
mM  Tris,  pH  8.0).  Four  hundred  micrograms  of  lysozyme  were  added  and  the  tube  was 
held  at  100C  for  40  seconds.  The  sample  was  centrifuged  for  15  minutes  at  16,000  g,  the 
supernatant  collected,  and  the  DNA  was  precipitated  with  0. 1 volume  8 M ammonium 
acetate  and  an  equal  volume  of  cold  isopropanol  at  -80C  for  60  minutes.  The  DNA  was 
pelleted  by  centrifuging  at  16,000  g for  20  minutes  and  rinsed  twice  with  absolute  ethanol. 
The  near-dry  DNA  was  resuspended  in  50  |il  sterile  distilled  H2O. 

DNA  and  RNA  Analysis 

Southern  blotting 

Mitochondrial  DNA  was  digested  with  restriction  endonucleases,  and  fractionated 
in  agarose  gels  in  1 X TPE  electrophoresis  buffer  (0.036  M Tris,  0.03  M NaH2PC>4,  1 
mM  Na2EDTA)  with  a constant  voltage  of  2 v/cm.  After  electrophoresis,  the  gel  was 
placed  in  ethidium  bromide  staining  solution  (EB,  0.4  |ig/ml)  for  30  minutes,  and 
photographed  with  transmitted  UV  illumination.  The  gel  was  treated  with  0.25  M HC1  for 
10  minutes,  rinsed  with  distilled  water  and  denatured  in  0.6  M NaCl  and  0.2  M NaOH  for 
30  minutes.  The  gel  was  rinsed  with  distilled  water  and  neutralized  in  3 M NaCl  and  1 M 
Tris,  pH7.5,  for  30  minutes.  Schleicher  & Schuell  nitrocellulose  membranes  (BA-S  85) 
were  prepared,  wetted  in  distilled  water,  and  placed  in  a tray  of  20  X SSC  (IX:  0.15  M 
NaCl,  15  mM  sodium  citrate)  (Sambrook  et  al.  1989)  for  5 minutes.  Three  sheets  of 
Whatman  3MM  paper  were  placed  on  a 3-cm-thick  stack  of  paper  towels,  and  one  piece  of 
nitrocellulose  membrane  was  laid  on  the  top  of  the  3MM  paper.  After  neutralization,  the 
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gel  was  placed  on  the  top  of  the  nitrocellulose  membrane  and  air  bubbles  were  removed 
with  a roller.  Another  nitrocellulose  membrane  was  removed  from  the  tray  and  laid  on  the 
top  of  the  gel.  Air  bubbles  were  squeezed  out  by  rolling  a roller  over  the  surface.  Another 
three  sheets  of  Whatman  3MM  paper  and  a 3-cm-thick  stack  of  paper  towel  were  placed  on 
the  top  of  the  filter.  The  transfer  was  allowed  to  proceed  for  three  hours,  and  the  DNA  was 
fixed  by  baking  the  filter  for  2 hours  at  80C. 

Preparation  and  labeling  of  probe  DNA 

The  probe  DNA  was  obtained  as  a cloned  fragment  which  was  purified  from  the 
vector  by  restriction  digestion  followed  by  recovery  from  an  agarose  gel  and  elution  using 
Qiaex  gel  extraction  kit  (Qiagen  Inc.)  as  per  manufacturer's  direction. 

The  DNA  was  labeled  by  random  oligonucleotide  priming  using  a random  primed 
DNA  labeling  kit  (Boehringer  Mannheim  Biochemicals).  Typically,  500  ng  of  DNA  was 
denatured  and  labeled  with  10-50  |iCi  [a-32P]dCTP  as  per  manufacturer's  direction.  The 
un-incorporated  deoxyribonucleoside  triphosphates  were  removed  by  chromatography  on  a 
Sephadex  G-50  column. 

Hybridization  of  DNA  blots 

The  immobilized  DNA  membrane  was  placed  in  a hybridization  tube  with 
hybridization  solution  [3  X SSC,  10  X Denhardt's  solution  (1  X:  0.02%  Ficoll  400, 

0.02%  polyvinylpyrrolidone  360,  0.02%  bovine  serum  albumin,  0.01%  sodium 
dodecylsulfate),  and  10  pg/ml  of  sheared  salmon  sperm  DNA]  (Chase  and  Pring  1986), 
and  incubated  for  one  hour,  with  rotation,  at  65C  in  the  hybridization  oven.  The  labeled 
DNA  was  denatured  for  10  minutes  at  100C,  placed  on  ice  for  5 minutes,  and  added  to  the 
hybridization  solution.  The  hybridization  proceeded  for  16  hours  at  65C.  After 
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hybridization,  the  membrane  was  washed  in  3 X SSC  for  5 minutes  at  65C,  and  washed 
twice  in  0.3  X SSC  for  15  minutes  each  at  65C. 

tRNAs 

Mitochondria  were  isolated  from  etiolated  coleoptiles  and  RNA  was  extracted  as 
described  above.  tRNAs  were  isolated  from  the  total  RNA  sample  by  holding  in  2 M LiCl 
at  4C  overnight  (Parks  et  al.  1984).  The  sample  was  centrifuged  at  15,900  g for  20 
minutes  at  4C  and  the  supernatant  was  collected.  The  RNAs  were  precipitated  from  the 
supernatant  by  adding  0. 1 volume  of  8 M ammonium  acetate  and  2.5  volume  of  absolute 
ethanol  and  placed  at  -80C  for  one  hour.  tRNAs  were  pelleted  by  centrifuging  at  15,900  g 
for  20  minutes,  washed  twice  with  absolute  ethanol  and  dried  under  vacuum.  The  dried 
tRNAs  were  resuspended  in  sterile  distilled  H2O. 

About  50  pg  of  the  resuspended  tRNA  was  then  treated  with  50  units  of  RNase- 
free  DNase  (Boehringer  Mannheim  Biochemicals)  in  a reaction  buffer  of  40  mM  Tris, 
pH7.5,  6 mM  MgCl2,  lOmM  NaCl  and  10  mM  DTT  at  37C  for  15  minutes.  The  sample 
was  extracted  twice  with  an  equal  volume  of  75%  water-saturated  phenol,  25%  chloroform 
and  once  with  chloroform-isoamyl  alcohol  (24:1).  The  tRNAs  were  precipitated  and  dried 
as  described  above.  The  dried  tRNAs  were  resuspended  in  sterile  distilled  H2O. 

The  resuspended  tRNAs  were  then  heated  at  55C  for  5 minutes,  placed  on  ice  for  5 
minutes,  and  treated  with  10  units  of  calf  intestine  alkaline  phosphatase  (Boehringer 
Mannheim  Biochemicals)  in  50  mM  Tris,  pH9.8,  0.1  mM  Na2EDTA,  at  50C  for  60 
minutes.  The  sample  was  extracted  with  phenol  and  chloroform,  and  dried  as  described 
above.  The  dried  RNAs  were  resuspended  in  sterile  distilled  H2O. 

The  5'  termini  of  tRNAs  were  labeled  with  10  units  of  T4  polynucleotide  kinase 
(New  England  Biolabs)  and  [y-32p]ATP  in  70  mM  Tris,  pH7.6,  10  mM  MgCl2,  5 mM 
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DTT,  40  (iCi  [y-32p]ATP  (3000  Ci/mmol)  and  80  units  of  human  placentral  RNase 
inhibitor  (RNAsin,  Promega  Corporation).  The  product  was  extracted  with  phenol  and 
chloroform,  and  dried  as  described  above. 

Polymerase  chain  reactions  (PCR) 

Polymerase  chain  reactions  were  performed  with  100  ng  of  template  DNA  and  2 
units  of  Vent  DNA  polymerase  (New  England  Biolabs)  plus  50  pmol  of  each  specified 
primer  in  100  |il  of  a solution  of  10  mM  KC1,  20  mM  Tris,  pH8.3,  10  mM  (NH4)2S04, 
200  pM  each  deoxynucleoside  triphosphates  (dNTP),  and  0.1%  Triton  X-100.  The 
reaction  mixture  were  cycled  in  a thermocycler  (Coy  Tempcycler)  for  30  cycles  (94C  1 
minute;  2 minutes  under  annealing  conditions;  and  72C,  3 minutes),  followed  by  extension 
at  72C  for  10  minutes.  The  products  were  treated  twice  with  equal  volume  of  chloroform- 
isoamylalcohol  (24:1)  and  precipitated  with  0.1  volume  of  8 M ammonium  acetate  and  2 
volumes  of  absolute  ethanol. 

For  cloning,  the  PCR  products  were  treated  with  Klenow  DNA  polymerase 
(Boehringer  Mannheim  Biochemicals)  to  generate  blunt  ends,  extracted  with  phenol  and 
chloroform,  and  dried  as  described  above.  The  resuspended  products  were  cloned  into  the 
Smal  site  of  Bluescript  SK(-)  (Stratagene)  unless  otherwise  specified. 

DNA  sequencing 

DNA  sequencing  was  carried  out  by  the  dideoxynucleotide  chain  termination 
method  (Sanger  et  al.  1977)  with  a DNA  sequencing  Kit  (United  States  Biochemical). 
Approximately  1 jig  of  single-stranded  DNA  (Ml 3)  or  3 [ig  of  double-stranded  plasmid 
DNA  was  used.  Double-stranded  DNA  was  denatured  in  0.2  M NaOH  and  0.2  mM 
Na2EDTA  at  25C  for  5 minutes,  followed  by  neutralization  with  0.1  volume  of  2 M 
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ammonium  acetate,  and  precipitated  with  0. 1 volume  of  8 M ammonium  acetate  and  2 
volumes  of  absolute  ethanol.  Annealing  was  performed  by  adding  2 pi  of  Sequenase 
reaction  buffer  and  1 pi  of  2.5  pmol  of  the  specific  primer  to  the  7 pi  of  redissolved 
denatured  plasmid  DNA  or  1 |il  of  0.5  pmol  of  Ml 3 specific  primer,  -40,  and  5 pi  M13 
DNA  with  2 pi  H2O.  The  mixture  was  heated  to  65C  for  2 minutes,  and  was  placed  at 
room  temperature  for  20  minutes  period.  The  following  reagents  were  added  to  the  ice- 
cold  annealed  DNA  mixture:  1 pi  of  0.1  M DTT,  2 pi  of  labeling  mixture,  10  pCi  of  [a- 
35S]dATP  and  2 pi  of  Sequenase  version  2.0.  The  mixture  was  incubated  for  5 minutes  at 
25C,  and  3.5  pi  of  the  reaction  was  transferred  to  each  A,  C,  G,  T tube  containing  2.5  pi 
of  dideoxynucleotide  termination  mixture  prewarmed  at  37C.  The  incubation  was 
continued  for  another  5 minutes  at  37C  and  the  reaction  was  stopped  by  adding  4 pi  of 
termination  reaction. 

Primer  extension  analysis 

Gene-specific  synthetic  oligonucleotides  (1  pmol)  were  labeled  with  40  pCi  [y- 
32P]ATP  (3000  Ci/mmol)  in  a reaction  containing  50  mM  Tris,  pH8.5,  10  mM  MgCl2,  5 
mM  DTT,  and  10  unit  of  T4  polynucleotide  kinase  (Boehringer  Mannheim  Biochemicals)  at 
37C  for  30  minutes.  Ten  micrograms  of  mtRNA  and  0. 1 pmol  labeled  primer  were 
denatured  by  heating  to  75C  for  5 minutes,  and  annealed  at  50C  for  5 minutes.  The 
extension  was  performed  in  a reaction  of  10  mM  Tris,  pH8.5,  6 mM  MgCl2,  50  mM  KC1, 

1 mM  DTT,  1 mM  deoxynucleotide  triphosphate  mixture  (dATP,  dCTP,  dGTP,  and  dTTP) 
and  12  units  of  AMV  reverse  transcriptase  (Boehringer  Mannheim  Biochemicals)  at  42C 
for  15  minutes.  The  reaction  was  terminated  by  5 pi  of  formamide  stop  buffer,  and  the 
sample  was  boiled  for  3 minutes  prior  to  loading  on  a 6%  denaturing  polyacrylamide  gel. 

Synthetic  nucleotides  used  in  primer  extension  analysis  were  listed  in  Table  2. 
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Characterization  of  Initiated  Transcript  Termini 
In  vitro  Capping  Reaction 

One  hundred  micrograms  of  sorghum  mtRNA  extracted  from  mitochondria  purified 
through  Percoll  gradients  were  capped  in  50  mM  Tris,  pH7.9,  1.25  mM  MgCl2,  2.5  mM 
DTT,  6 mM  KC1,  80  units  of  RNase  inhibitor  (RNasin,  Promega  Corporation),  1 pM 
GTP,  250  pCi  [a-32P]GTP  (3000  Ci/mmol)  and  12  units  of  guanylyltransferase  (GIBCO 
BRL  Life  Technologies)  at  37C  for  30  minutes.  The  reaction  was  terminated  with  20  pg  of 
Proteinase  K in  100  pi  5 mM  Na2EDTA  and  0.5%  SDS  at  37C  for  15  minutes.  The 
capped  RNA  was  extracted,  precipitated  and  resuspend  in  sterile  distilled  H2O,  as 

described  above. 

Antisense  RNA 

Linearized  templates  in  pBluescript  vector  were  used  for  the  respective  genes  to 
transcribe  RNA  in  vitro.  The  transcription  reaction  was  performed  in  20  pi  of  40  mM  Tris, 
pH7.5,  6 mM  MgCl2,  2 mM  spermidine,  10  mM  NaCl,  10  mM  DTT,  20  units  of  RNase 
inhibitor  (RNasin,  Promega  Corporation),  0.5  mM  NTP  mixture  (ATP,  CTP,  GTP,  UTP), 
200-500  ng  template  DNA  and  15  units  of  T3  or  T7  RNA  polymerase  at  37C  for  30 
minutes.  The  template  DNA  was  removed  by  digestion  with  50  units  of  RNase-free  DNase 
(Boehringer  Mannheim  Biochemicals)  at  37C  for  15  minutes.  The  reaction  was  extracted, 
precipitated  and  resuspend  in  sterile  distilled  H2O. 
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Ribonuclease  Protection 

Approximately  10  pg  of  capped  sorghum  mtRNA  and  2 pg  of  antisense  RNA  were 
denatured  at  85C  for  5 minutes  in  30  pi  of  hybridization  solution  of  80%  deionized 
formamide,  40  mM  PIPES  (Piperazine-N,  N'-bis [2-ethane-sulfonic  acid]),  pH6.4,  0.4  M 
sodium  acetate,  1 mM  Na2EDTA,  and  hybridization  was  allowed  to  proceed  at  37C  for  16 
hours.  Three  hundred  ul  of  RNase  mixture  was  added  which  contained  20  mM  sodium 
acetate,  10  mM  Tris,  pH7.4,  5 mM  Na2EDTA,  one  unit  of  RNase-One  (Promega 
Corporation)  unless  specified,  and  incubated  at  20C  for  30  minutes.  The  reaction  were 
terminated  by  3.3  pi  of  10%  SDS  and  precipitated  with  0.1  volume  of  8 M ammonium 
acetate  and  2.5  volumes  of  absolute  ethanol.  The  RNA  was  denatured  by  heating  at  70C 
for  10  minutes  before  loading  on  a 6%  polyacrylamide  gel. 

Sizing  of  the  Protected  Fragments 

The  size  of  the  protected  fragment  of  capped  RNA  was  approximated  by 
electrophoresis  on  6%  polyacrylamide  gel,  using  Ml 3 or  mtDNA  genomic  clones  of 
known  sequences  as  markers.  Due  to  the  mass  difference  between  a nucleotide  chain  and 
the  deoxynucleotide  marker  chain,  corrections  were  made  by  calculating  the  actual  mass  of 
the  expected  RNA  product  and  DNA  marker  utilized  in  the  comparison.  The  protected 
fragment  was  thus  approximately  sized,  and  the  assignment  of  precise  5'  termini  was  based 
on  primer  extension  experiments. 
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RNA  Polymerase  Assay 

Purification  of  RNA  Polymerase  from  Maize 

Mitochondria  of  Pioneer  hybrid  maize  were  isolated  as  described  above  for  protein 
purification.  The  transcript  competent  fraction  was  isolated  using  the  procedure  described 
by  Rapp  and  Stem  (1992). 

Mitochondria  were  resuspended  at  4C  in  5 ml  of  mitochondrial  lysis  buffer  of  20 
mM  Tris,  pH8.0, 0.2  mM  Na2EDTA,  1 mM  DTT,  1 mM  Pefabloc  (Boehringer  Mannheim 
Biochemicals)  and  15%  glycerol,  and  transferred  to  a sterile  glass  tissue  homogenizer. 
Sterile  distilled  H2O  (2.12  ml)  was  added,  followed  by  20  stokes  with  the  pestle.  Triton 
X-100  (20%,  0.25  ml)  was  added,  followed  by  20  strokes  with  the  pestle.  Finally,  2.62 
ml  of  4 M KC1  was  added,  followed  by  20  strokes  with  the  pestle.  The  mitochondrial 
lysate  was  placed  on  ice  for  15  minutes  with  vortexing  every  5 minutes.  The  lysate  was 
centrifuged  at  4C  at  32,500  rpm  (Beckman  SW50.1,  100,000  g)  for  1 hour  and  the 
supernatant  was  collected.  Centrifugation  of  the  lysate  at  100,000  g yielded  a membrane- 
free  soluble  protein  fraction. 

An  equal  volume  of  S-100  dilution  buffer  (90  mM  Tris,  pH8.0,  0.1  mM 
Na2EDTA,  0.5  mM  DTT,  2.5%  glycerol)  was  added  to  the  S-100  supernatant,  and  the 
fraction  was  subjected  to  fractional  precipitation  with  20%  ammonium  sulfate  by 
centrifuging  at  12,000  rpm  for  15  minutes  with  Sorvall  SS34  rotor.  The  supernatant  was 
further  subjected  to  20-50%  ammonium  sulfate  precipitation  and  the  protein  was 
precipitated  by  centrifuging  at  12,000  rpm  for  1 hour  with  Sorvall  SS34  rotor.  The  pellet 
was  resuspended  in  2 ml  of  buffer  of  10  mM  Tris,  pH8.0,  1 mM  DTT,  0.1  mM 
Na2EDTA,  7.5%  glycerol,  50  mM  of  KC1,  and  dialyzed  against  two  changes  of  the  same 


buffer. 
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The  dialyzed  20-50%  ammonium  sulfate  cut  was  loaded  onto  a mono-Q  FPLC 
column  (Pharmacia  LKB  Biotechnology)  and  was  eluted  by  step-wise  increases  in  KC1 
concentration  of  0.1,  0.2,  0.3,  and  0.5  M (in  the  above  buffer)  with  a elution  rate  0.5 
ml/minute.  The  protein  eluted  in  0.3  M KC1  was  collected,  combined  if  within  the  same 
peak,  and  dialyzed  in  buffer  A with  10  mM  KC1.  The  dialyzed  protein  was  concentrated  to 
100-150  |xl  using  centricon-10  unit  (Amicon).  The  concentrated  protein  was  aliquoted, 
quickly  frozen  in  liquid  nitrogen,  and  stored  at  -80C.  The  concentration  of  the  proteins 
was  determined  by  the  microassay  of  the  BioRad  measuring  the  absorbance  at  595  nm. 

In  vitro  Transcription  Assay 

In  vitro  transcription  assays  were  performed  in  a 12.5  pi  reaction  of  10  mM  Tris, 
pH7.9,  10  mM  MgCl2,  1 mM  DTT,  0.5  mM  each  ATP,  CTP  and  GTP,  25  pM  UTP,  10 
pCi  [a-32p]UTP  (3000  Ci/mmol),  40  units  of  RNase  inhibitor  (RNAsin,  Promega 
Corporation),  60  ng  of  linearized  template  DNA  or  40  ng  of  insert  DNA,  and  20-40  pg  of 
protein.  The  reaction  was  incubated  at  30C  for  30  minutes,  and  terminated  by  the  addition 
of  37.5  pi  of  stop  solution  containing  4.8  M urea,  0.4  M sodium  acetate,  5.3  mM 
aurintricarboxylic  acid,  26  pg/ml  tRNA  and  0.8%  SDS.  The  mixture  was  immediately 
extracted  twice  with  equal  volume  of  75%  water-saturated  phenol  and  25%  chloroform  and 
once  with  chloroform-isoamylalcohol  (24:1).  The  reaction  products  were  precipitated  with 
0. 1 volume  of  8 M ammonium  acetate  and  2.5  volumes  of  absolute  ethanol,  and  washed 
twice  with  absolute  ethanol.  The  dried  products  were  resuspended  in  80%  formamide,  1 X 
TBE  and  electrophoresed  in  denaturing  polyacrylamide  gels  with  1 X TBE  electrophoresis 
buffer  (0.089  M Tris-borate,  0.089  M boric  acid  and  2 mM  Na2EDTA,  pH8.0)  (Sambrook 
et  al.  1989). 
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Table  1.  Recombinant  DNA  clones  used  in  the  investigation. 
Clone  Vector  Kb  Fragment  Features 


References 


pAFl 

pUC18  6.5 

BamHI 

pJM37 

pBY144 

pUC18  0.217 
pBSSK(-)  0.268 

Xbal/EcoRI 

Hindm 

pBY150 

pBSSK(-)  0.262 

DP19/DP46 

pAF2 

pUC18  5.0 

BamHI 

pBY148 

pBSSK(-)  0.443 

DP31/DP17 

pRS34 

pUC119  11 

BamHI 

pRS35 

pRS61 

pUC119  1.4 
pUC119  6.8 

PstI 

Smal 

pRS62 

pUC119  1.236 

EcoRI 

pRS66 

pRS72 

pBY132 

pUC119  0.785 
pUCl  19  0.457 
pBSSK(-)  0.807 

Pstl/EcoRI 

EcoRI/PstI 

1201/DP48 

pBY136 

pBSSK(-)  0.524 

DP47/121 1 

pHC86 

pUC19  1.223 

BamHI 

pHC103 

pBY145 

pHT130 

pBY124 

pUC19  0.214 
pBSSK(-)  0.214 
pUC19  0.378 
pUC19  2.9 

BamHI/Xbal 

BamHI/Xbal 

Alul/Alul 

Bgin 

pBY128 

pBY142 

pHT98 

pUC19  1.8 
pBSSK(-)  1.8 
pUC19  1.3 

EcoRI/BamHI 

EcoRI/BamHI 

Sphll/Hindlll 

pBY146 

pBSSK(-)  1.0 

BamHI 

pHT70 

pUC19  5.8 

BamHI 

Includes  atp6-l 
IS1112C 

(1) 

Includes  part  of  atp6-l 

(2) 

Includes  part  of  atp6-l 
T3=anti  sense 

this  work 

5'  to  atp6-l 
T3-DP46  proximal 
T3=antisense 

this  work 

Includes  atp6-2 
IS1112C 

(1) 

5'  to  atp6-2 
T3-DP17  proximal 
T3=antisense 

this  work 

Includes  atp9 
Tx398 

(3,4) 

Includes  atp9 

(3,  4) 

Includes  5'  to  & part  of  atp9 
IS1112C 

(3,  4) 

Includes  rmfM  & atp9 
Tx398 

(3,  4) 

Includes  atp9 

(3,  4) 

Includes  fmfM 

(3,  4) 

5'  to  atp9 
T3-1201  proximal 
T3=sense 

this  work 

Includes  atp9 
T3-1211  proximal 
T3=antisense 

this  work 

Includes  orfl07 
IS1112C 

(6) 

5'  to  orfl07 

(6) 

5'  to  orfl07 

this  work 

Includes  part  of  orfl07 

(6) 

Includes  orfl07 
IS1112C 

(6) 

5'  to  orfl07 

this  work 

5'  to  orfl07 

this  work 

Includes  5'  to  & 
part  of  urf209 

(5) 

Includes  5'  to  & 
part  of  urf209 

this  work 

Includes  5'  to  & 
part  of  orf265 

(5) 
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Table  1.  Recombinant  DNA  clones  used  in  the  investigation  (continued). 


Clone 

Vector  Kb 

Fragment 

Features 

References 

pHT75 

pUC19  0.91 

BamHI/Kpnl 

Includes  5'  to  & 
part  of  orf265 

(5) 

pBY133 

pBSSK(-)  0.319 

1211/DP92 

5’  to  orf265 
T3-DP92  proximal 
T3=anti  sense 

this  work 

pHTlOO 

pUC19  0.8 

EcoRI/Kpnl 

5'  to  orf265 

(5) 

pBY149 

pBSSK(-)  0.8 

EcoRI/Kpnl 

5'  to  orf265 

this  work 

1.  Kempken  et  al.  1991. 

2.  Mullen  et  al.  1992. 

3.  Salazar  et  al.  1991. 

4.  Salazar  1991. 

5.  Tang  et  al.  submitted. 

6.  Tang  et  al.  in  preparation. 


41 


Table  2.  Synthetic  oligonucleotides  utilized  in  primer  extension  analyses.  All  the 
oligonucleotides  listed  were  synthesized  by  the  DNA  synthesis  laboratory,  Interdisciplinary 
Center  for  Biotechnology  Research,  University  of  Florida. 


Gene/orf  Oligonucleotide  Sequence 


bp  5'  to  start  codon 
(5'  end  of  the  primer) 


atp6-l 

DP13 

5-GTGACGTGGGTTTTACCCAGCG-3’ 

-52 

DP46 

5-CCACTT  AGAAAT  AT  AAATAAAAAG-3' 

-241 

atp6-2 

DP63 

5'-GGGCGTTGGTGTTTCCAACGA-3' 

-59 

DP49 

5'- AAGAC  AAAATGCT  AGCTTCCC-3 ' 

-116 

atp9 

DP48 

5-CCTTCCTCCTCTCCTTAGTTG-3' 

-68 

DP73 

5-GGTGATCAGGACTCGTACAAG-3' 

-288 

orfl07 

DP  102 

5'-TCCCATTCCAAATACCGGCGG-3' 

+65 

DP  103 

• 5'-ATTTGTCTGGGATTGCGGCC-3' 

-364 

orf265 

DP  127 

5'-GAACACGGACTGCCATTAC-3' 

+296 

DP06 

5-AGACTCGATTCTCCCGTCGAGAG-3' 

+131 

DP131 

5-GTAGGGAGCACACAACTTCTC-3' 

-31 

RESULTS 


The  transcript  patterns  of  the  six  selected  sorghum  mitochondrial  genes  and  open 
reading  frames  (orfs)  utilized  in  this  study  have  been  described  (Mullen  et  al.  1992,  Salazar 
1991,  Tang  et  al.  submitted,  Tang  et  al.  in  preparation).  Thus,  to  map  the  transcription 
initiation  sites  corresponding  to  the  5'  primary  transcript  termini  of  selected  genes,  gene- 
specific  fragments  which  include  5'  flanking  sequence  of  the  gene  were  cloned  and 
sequenced,  and  antisense  RNA  was  synthesized  in  vitro.  Since  5'  termini  of  primary 
transcripts  carry  tri-  or  di-phosphates  (Christianson  et  al.  1983),  while  processed  transcripts 
have  only  monophosphate  or  hydroxyl  groups  at  their  5'  ends,  primary  transcripts  could  be 
labeled  by  capping  at  their  5'  ends  with  the  enzyme  guanylyltransferase.  In  vitro  capping  in 
combination  with  ribonuclease  protection  was  conducted  to  identify  the  5'  termini  of  the 
primary  transcripts.  Total  mitochondrial  RNA  was  prepared  from  mitochondria  purified 
through  Percoll  gradients,  and  used  in  an  in  vitro  capping  reaction  with  a-32p-GTP  and 
guanylyltransferase.  The  capped  RNA  was  hybridized  to  the  antisense  RNA  synthesized 
from  the  gene-specific  clones,  followed  by  ribonuclease  treatment.  The  protected  fragment 
was  analyzed  to  give  an  approximate  size.  To  precisely  map  the  5’  transcript  ends,  primer 
extensions  with  gene  specific  primers  were  performed. 

In  this  study,  the  5'  transcript  termini  of  six  sorghum  mitochondrial  genes  and  open 
reading  frames,  including  two  orfs  which  are  associated  with  CMS  in  sorghum,  have  been 
identified,  and  the  transcription  initiation  sites  have  been  precisely  positioned.  The 
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sequences  surrounding  initiation  sites  were  compared  and  aligned.  The  promoter  of 
sorghum  mitochondrial  atp6-2,  which  is  homologous  to  that  of  maize  atp6  of  mitochondrial 
DNA,  was  tested  in  an  in  vitro  maize  transcription  system,  and  the  transcription  initiated  at  a 
correct  site. 


Transcription  Initiation  of  atp6-l 

Sorghum  mitochondrial  atp6  occurs  as  one  or  two  copies,  depending  on  cytoplasm 
(Mullen  et  al.  1992).  All  sorghum  lines  examined  to  date  carry  a copy  designated  as  atp6-2 
(Mullen  et  al.  1992,  Pring  et  al.  1988, ).  This  form  of  the  gene  consists  of  an  open  reading 
frame  of  1 137  bp,  including  a 756  bp  conserved  core  region  as  those  in  yeast  and  other 
higher  plant  systems,  flanked  5’  by  a 38  i bp  amino  extension  (Kempken  et  al.  1991, 
Mullen  et  al.  1992).  The  mature  ATP6  polypeptide,  from  the  core  region,  probably  results 
from  proteolytic  cleavage,  generating  uniform  amino  termini  (Krishnasamy  et  al.  1994). 

An  additional  copy  of  the  atp6  gene,  atp6-l,  is  found  in  type  members  of  the  Al 
(three  dwarf  white  milo),  A2  (IS12662C)  and  A3  (IS1 1 12C)  groups  of  male-sterile 
cytoplasms  of  sorghum  (Mullen  et  al.  1992,  Pring  et  al.  1988).  This  version  consists  of  an 
open  reading  frame  of  1002  bp,  sharing  an  identical  core  region  of  756  bp  as  that  in  atp6-2, 
and  flanked  5’  by  divergent  extension  of  246  bp  (Mullen  et  al.  1992).  A schematic  diagram 
of  the  core  regions,  amino  extensions  of  both  copies  is  shown  in  Figure  1.  Sequences  5' 
to  atp6-l  and  atp6-2  exhibit  limited  sequence  similarity  for  about  100  bp,  and  then  diverge, 
which  indicates  distinct  5'  transcript  termini  for  the  atp6-l  and  atp6-2  transcripts  (Pring  et 
al.  1992). 

Both  versions  of  atp6  are  transcribed  (Mullen  et  al.  1992),  and  their  transcripts  are 
edited  (Kempken  et  al.  1991).  The  editing  events  include  18  C-to-U  substitutions  within 
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500  bp 


♦DP46  +DP13 

J I 


( atp6-l  ) ~] 


atp6  core  polypeptide,  756  bp 


( aty6-2  ) | 


5'  extensions 
♦DP63 


( plastid  DNA  ) 


Figure  1.  Schematic  diagram  of  sorghum  mitochondrial  atp6-l  and  atp6-2  core 
regions  (dotted  box),  amino  extensions  (striped  box)  and  the  locations  of  the 
primers  (small  black  box)  utilized  in  primer  extension  experiments. 
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the  core  polypeptide  and  one  edit  at  the  carboxy  extension,  altering  15  amino  acids  and 
resulting  in  a carboxyl-terminus  identical  to  that  of  yeast  (Kempken  et  al.  1991).  The 
amino  extension  of  atp6-2  has  one  silent  edit  (Kempken  et  al.  1991),  while  the  amino 
extension  of  atp6-l  exhibits  one  edit  that  changes  one  amino  acid  (Mullen  et  al.  1992). 

Transcription  of  atp6-l  is  characterized  with  a single  transcript  of  1500  nt  (Mullen 
et  al.  1992).  To  map  the  transcription  initiation  site  of  atp6-l,  the  location  of  the  5' 
terminus  of  the  primary  atp6-l  transcript  was  identified  by  ribonuclease  protection  with  a 
gene-specific  fragment. 

Cloning  Gene  Specific  Fragment 

A region  5'  to  atp6-l  was  cloned.  Oligonucleotide  DP19,  5'- 
GCTCATATCATTCTGCGTGC-3',  corresponding  to  a sequence  between  -587  and  -565, 
and  oligonucleotide  DP46  complementary  to  a sequence  between  -343  and  -320  5'  to  the 
start  codon  (Table  2),  were  used  in  polymerase  chain  reaction  with  pAFl  DNA  as  a 
template.  The  amplified  fragments  were  cloned  into  Smal-digested  pBSSK(-).  Sequence 
analysis  revealed  that  clone  pBY150  had  the  DP46  end  adjacent  to  the  T3  promoter,  and 
that  the  T3  RNA  polymerase  initiated  antisense  strand  transcription  of  the  cloned  fragment. 

In  vitro  Capping  and  Ribonuclease  Protection 

In  vitro  capping  was  carried  out  to  identify  the  5'  primary  transcript  end(s)  of  atp6- 
1.  Bluescript  clone  pBY150  DNA  was  used  as  a template  for  synthesis  of  antisense  RNA 
for  ribonuclease  protection  experiments.  Single-stranded  antisense  RNA  was  transcribed 
in  vitro  from  £coRI-digested  pBY150  DNA  with  T3  RNA  polymerase,  and  hybridized  to 
the  capped  RNA,  followed  by  ribonuclease  digestion.  The  protected  fragment  was 
analyzed  as  shown  in  Figure  2.  The  5'  termini  of  the  protected  product  were  assigned  to 


Figure  2.  Ribonuclease  protection  of  the  atp6-l  transcript.  Antisense  RNA  (solid  line)  and 
sense  RNA  synthesized  from  Bluescript  clone  pBY150  were  hybridized  with  capped 
mtRNAs,  respectively,  and  treated  with  ribonuclease.  The  protected  region  (dashed  line) 
was  resolved  with  M13  sequence  as  a marker.  Lane  1:  Capped  RNAs  were  hybridized 
with  antisense  RNA.  Lane  2:  Capped  RNAs  were  hybridized  with  sense  RNA. 
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the  corresponding  DNA  sequence  of  about  -441  and  -437  nucleotide  5'  to  the  start  codon. 
As  a control,  single  stranded  sense  RNA  was  also  synthesized  with  T7  RNA  polymerase 
using  BamHl  digested  pBY150  DNA  as  a template.  As  expected,  no  protected  signal  was 
detected  (Figure  2). 

Primer  Extension 

To  precisely  determine  the  5'  end(s)  of  the  atp6-l  transcript,  oligonucleotide  DP13 
(Table  2),  complementary  to  a sequence  between  -51  to  -72  5'  to  the  start  codon,  was 
prepared.  Mitochondrial  RNA  from  line  IS1 1 12C  was  used  in  a primer  extension  reaction 
with  primer  DP  13,  and  the  extended  product  was  analyzed  as  shown  in  Figure  3 A.  A 
major  product  of  about  392  bp  was  identified,  with  a terminus  corresponding  to  the 
nucleotide  of  -442  5'  to  the  start  codon  (Figure  4). 

Similarly,  another  primer,  DP46  (Table  2),  which  is  270  bp  upstream  of  DP13, 
complementary  to  a sequence  between  -241  to  -264  was  utilized  in  primer  extension 
experiments.  A major  extended  product  of  122  bp  was  detected  (Figure  3B),  with  the  end 
corresponding  to  nucleotide  -442  (Figure  4).  In  addition,  a minor  product  of  1 19  bp  was 
also  detected,  with  the  terminus  corresponding  to  nucleotide  -439  5'  to  the  start  codon. 

Transcription  Initiation  of  atp6-2 

Transcription  of  this  copy  is  characterized  by  a major  transcript  of  1450  nt  (Mullen 
et  al.  1992).  A schematic  diagram  of  atp6-2  is  shown  (Figure  5),  including  locations  of  the 
primers  that  were  used  in  the  following  analyses. 
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A.  ACGT  B.  ACGT 


Figure  3.  Primer  extension  analysis  of  atp6-1.  A:  Oligonucleotide 
DPI  3 was  used  in  dideoxy  sequencing  of  pAFI  and  the  primer 
extension  reaction.  B:  Oligonucleotide  DP46  was  used  in  dideoxy 
sequencing  of  clone  pBY150  and  the  primer  extension  reaction  with 
sorghum  mtRNA. 
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ATTRCm^TATCnv;APCm^(?IYy?TOACGTCCTA(aTAGTTTCGTAGCTCGATTGCTCATATC^TTCTGCgrGCTGCATAG  -561 

DP19o 

TCATAGGAAGCAGATAGTAAGCCTCTCTCATTCAGTAATGCAGATTTGCCAAGTTCTGCrGGACGGCCTACGCGCTCATC  -481 

V V 

♦ ♦ 

CGCTTCGACCTCTACGTCTACGTGAATAAACTTACAATGGAGATCTACTGTGGGAAGGTACGAGCGCTAAAACGACAGTA  -401 

AfvyTyfy'Ty-TAAAPCnAAnAAARATTGATACGCCAAGGCAATAGCAGTGATACCACl'm’rATTTATATTTCTAAGTGG  -321 

ODP46 

TTAGTCGAAGTAGTATTTTAGTAGTATTTTATTTATGTAGTAGAATCATTACCAACGGCAAGTCGATGGGACGCGGAAGT  -241 

OTATCCGTTAGAAAAAATGTAAAGTAGGATTTGGTITTGTCATT'ITAGAGAAGTGAGGATTTAAGTTCTAAATAAATAAA  - 1 6 1 

TTGGCTTTTGATAGGCTTTCTCGGTTCACTAAGTGAATCGAATCATTGTGCAATGTTATGTGTCTAAATCTC^ATTTAGT  -81 

ATTCGTTTCGCTGGGTAAAACCCACGTCACTAAAAGAAAATAAGTCTCCCTTTCTC'ITTTCGGGAGCAGAGCTTAAAAAG  -1 
ODP13 

ATG  +3 


Figure  4.  Nucleotide  sequence  of  the  5'  flanking  region  of  atp6-1.  The  closed  arrows 
represent  the  major  termini  mapped  by  primer  extension  experiments.  The  locations 
of  the  oligonucleotides  are  underlined.  The  open  triangles  represent  the  termini  of 
the  protected  fragments  from  in  vitro  capping  in  conjunction  with  ribonuclease 
protection  experiments. 


Figure  5.  Ribonuclease  protection  of  the  atp6-2  transcript.  Antisense  RNA  (solid  line) 
synthesized  from  Bluescript  clone  pBY148  was  hybridized  with  capped  mtRNA,  and 
treated  with  ribonuclease.  The  protected  region  (dashed  line)  was  analyzed  with  clone 
pBY148  sequence  as  a marker. 
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Cloning  Gene  Specific  Fragment 

A region  5'  to  the  atp6-2  gene,  corresponding  to  a sequence  between 
+105  to  -338  5'  to  the  start  codon,  was  cloned.  Oligonucleotide  DP31,  5'- 
GCATAGATCCGGAGATTCCC-3',  corresponding  to  a sequence  between  -338  and  -317, 
and  oligonucleotide  DPI 7,  5 -CACCGGGGGCATCG-3',  complementary  to  a sequence 
between  +92  and  +105,  relative  to  the  start  codon,  were  prepared  and  used  in  the 
polymerase  chain  reaction  with  pAF2  DNA  (Mullen  et  al.  1992)  as  a template.  The 
amplified  fragments  were  cloned  into  Smal  digested  pBSSK(-).  Sequencing  analysis 
revealed  that  clone  pBY148  had  the  DP  17  end  adjacent  to  the  T3  promoter,  and  that  the  T3 
RNA  polymerase  initiated  antisense  strand  transcription  of  the  cloned  fragment. 

In  vitro  Capping  and  Ribonuclease  Protection: 

Bluescript  clone  pBY148  was  used  as  a template  for  transcribing  antisense  RNA  in 
vitro.  Single  stranded  antisense  RNA  was  synthesized  from  BamHl  digested  pBY148 
DNA  with  T3  RNA  polymerase,  and  hybridized  to  the  capped  RNA,  followed  by 
ribonuclease  digestion.  The  protected  fragment  was  analyzed  (Figure  5).  The  5'  terminus 
of  the  protected  product  was  assigned  to  the  corresponding  DNA  sequence  of  about  -179  5’ 
to  the  start  codon.  Another  minor  9 nt  longer  protected  fragment  was  also  detected,  with  a 
5'  terminus  corresponding  to  a residue  of  -188. 

Primer  Extension  Analysis: 

To  precisely  map  the  5'  end(s)  of  the  atp6-2  transcript,  oligonucleotide  DP63  (Table 
2),  complementary  to  a sequence  of  -59  to  -79,  relative  to  the  start  codon,  was  prepared. 
Mitochondrial  RNA  was  analyzed  with  primer  DP63  in  primer  extension  reactions.  A 
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major  extended  product  of  131  bp  was  identified  (Figure  6A)  with  terminus  corresponding 
to  -188  5'  to  the  start  codon. 

To  verify  the  result  obtained  with  primer  DP63,  another  primer, 

DP49  (Table  2),  complementary  to  a gene  specific  sequence  of  -1 16  to  -136,  was  used  in 
primer  extension  experiment.  A major  extended  product  of  73  bp  was  detected  (Figure 
6B),  with  terminus  corresponding  to  nucleotide  -188,  relative  to  the  start  codon  (Figure  7), 
thus  confirming  the  result  obtained  with  primer  DP63. 

Transcription  Initiation  of  atp9 

Sorghum  mitochondrial  atp9  occurs  as  one  copy  in  normal  line  Tx398,  and  two 
identical  copies  in  some  lines,  such  as  IS  1 1 12C,  A3Tx398  and  A3Tx7000  (Salazar  1991, 
Salazar  et  al.  1991).  The  coding  region  of  three  identical  versions  consists  of  an  open 
reading  frame  of  237  bp  and  would  encode  a 79  amino  acid  polypeptide  (Salazar  et  al. 
1991). 

Transcription  of  sorghum  atp9  is  characterized  by  a single  transcript  of  650  nt  in  all 
the  lines  examined  (Salazar  1991).  A total  of  eight  C-to-U  editing  events  were  revealed 
within  the  coding  region  of  the  copy  in  either  line  Tx398  or  IS  1 1 12C,  changing  six  amino 
acids  and  generating  a new  stop  codon  which  eliminates  1 2 residues  predicted  from  the 
genomic  sequence  (Salazar  et  al.  1991). 

Sequence  analysis  showed  r™fM  is  located  -323  bp  5'  to  all  three  copies  of  atp9 
(Salazar  1991),  which  is  98.7%  identical  to  trafM  of  maize  (Parks  et  al.  1984)  and  wheat 
(Sprinzl  et  al.  1989).  To  identify  the  transcriptional  properties  of  atp9  and  the  transcription 
initiation  site(s),  in  vitro  capping  experiments  were  conducted. 
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A.  A C G T 


B.  A C G T 


Figure  6.  Primer  extension  analysis  of  atp6-2.  A:  Oligonucleotide  DP63 
was  used  for  the  primer  extension  reaction  with  Ml  3 sequence  as  a 
marker.  B:  Oligonucleotide  DP49  was  used  in  dideoxy  sequencing  of 
clone  pBY148  and  the  primer  extension  reaction  with  sorghum  mtRNA. 
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rvKnnn a rrr ifiar.irr.1  anncrnT AnrJkAnraAAgGAAATGCTTCGGGGAGTTGAAATAAGCATAGATCCGGAGAT  -321 

DP31o 


32CCAAATAGGTCAACCTTTTGAACTGCCTGTTGAATCCATGAGCAGGCAAGAGACAACCTAGCGAACTGAAACATCTTA  -241 

V V 

4 

GTAGCCAGAGGAAAATCAAGTATCATGTTGCTCCTCAGAAAACTCGTATAATAGTCTCATTGGCCAAAGTCGATGGGACG  -161 


CGGAAGTGTATGCGTTACTGTACTGGGAAGCTAGCATTTTGTCTTGTCATGGAAGTTCGTGTTCGTGTTCGAATTTG'I'I'T 

<f»DP43 


Bglll 

TTOjTTGGAAACACCAACGCCCACCAAAAAGTCTaiC'TTTCTTTCGGGAGCAGATCTTATAAATCTTACTGTATATAGCr 

<s=DP63 


-81 


-1 


ATG 


+3 


Figure  7.  Nucleotide  sequence  of  the  5'  flanking  region  of  atp6-2.  The  closed  arrow 
represents  the  major  terminus  mapped  by  primer  extension  analysis.  The  locations 
of  the  oligonucleotides  are  underlined.  The  open  triangle  represents  the  terminus  of 
protected  fragment  from  in  vitro  capping  in  conjunction  with  ribonuclease  protection 
experiments. 
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In  vitro  Capping 

Hybridization  of  in  vitro  capped  mRNA  to  a slot  blot  containing  clones  of  the  tmfM 
(Figure  8,  A and  B)  and  atp9  (Figure  8,  A and  C)  is  shown.  In  vitro  capped  RNA 
strongly  hybridized  to  subclones  pRS62  (A)  and  pRS35  (C)  and  only  poor  hybridization 
was  detected  with  subclone  pRS72  (B).  Since  DNAs  were  loaded  in  equal  amounts  and 
only  the  primary  transcripts  were  labeled  (capped  RNA),  the  observation  that  clones  pRS62 
and  pRS35  gave  much  stronger  hybridizations  suggests  that  atp9  and  tmfM  may  be 
transcribed  in  different  transcriptional  units,  or  atp9  may  have  its  own  initiation  region  in 
addition  to  sharing  a precursor  with  tmfM.  The  second  possibility  was  strongly  supported 
by  the  results  of  ribonuclease  protection. 

Cloning  Gene  Specific  Fragments  for  Ribonuclease  Protection 

Oligonucleotide  1201,  5-AACAGCTAGACCATG-3',  corresponding  to  a 
sequence  of  the  vector  of  pRS62,  and  oligonucleotide  DP48  (Table  2),  complementary  to  a 
sequence  between  -68  and  -88  relative  to  the  start  codon,  were  used  in  the  polymerase 
chain  reactions  with  pRS62  DNA  (Salazar  et  al.  1991)  as  a template.  The  amplified 
fragments  were  cloned  into  Smal  digested  pBSSK(-).  Sequencing  revealed  that  clone 
pBY132  had  the  1201  end  adjacent  to  the  T3  promoter,  and  that  the  T3  RNA  polymerase 
initiated  sense  strand  transcription  of  the  cloned  fragment. 

Oligonucleotide  DP47,  5-CAACTAAGGAGAGGAGGAAGG-3',  corresponding 
to  a sequence  between  -88  to  -68,  and  oligonucleotide  1211,5'- 
GTAAAACGACGGCCAGT-3',  complementary  to  a sequence  of  pRS62  vector,  were 
used  in  the  polymerase  chain  reactions  with  pRS62  DNA  as  a template.  The  amplified 
fragments  were  cloned  into  Smal  digested  pBSSK(-).  Sequencing  analysis  revealed  that 
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Figure  8.  Hybridization  of  mtRNA  capped  with  guanylyltransferase  and 

[a-32P]GTP  to  a slot  blot  containing  plasmid  DNA  of  pRS62  (A),  pRS72 
(B),  pRS35  (C)  and  vector  pUC119  (D).  In  (E),  a DNA  blot  containing 
clone  PRS66  was  hybridized  to  kinased  mtRNA  with  polynucleotide 

kinase  and  [y-32P]ATP. 
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the  clone  pBY136  had  the  121 1 end  adjacent  to  the  T3  promoter,  and  that  the  T3  RNA 
polymerase  initiated  antisense  strand  transcription  of  the  cloned  fragment. 

Ribonuclease  Protection 

Single  stranded  antisense  RNAs  were  transcribed  in  vitro  from  clones  pBY132  and 
pBY136,  and  hybridized  with  capped  mtRNA,  respectively,  followed  by  ribonuclease 
digestion.  The  protected  fragments  were  analyzed.  Antisense  RNA  derived  from  clone 
pBY132  was  used  in  the  protection  experiment  (Figure  9),  and  the  major  protected 
fragments  were  sized  as  241,  238  and  234  nt,  with  termini  corresponding  DNA  sequence 
of  -309,  -305  and  -302,  relative  to  the  start  codon,  which  are  located  3'  to  the  3'  terminus 
of  tmfM.  This  suggests  that  there  is  an  initiation  region  3'  to  tmfM  that  promotes 
transcription  of  atp9.  Antisense  RNA  transcribed  the  downstream  fragment  pBY136  was 
also  used  in  the  protection  experiment,  and  no  signal  was  detected  (not  shown),  suggesting 
that  the  pBY136  region  lacked  initiation  termini.  As  controls,  capped  RNA  with  and 
without  ribonuclease  digestion  were  also  analyzed. 

Primer  Extension  Analysis 

To  precisely  localize  the  5'  termini  of  the  atp9  transcript,  oligonucleotide  DP48 
(Table  2),  complementary  to  gene-specific  sequences  of  atp9,  was  used  in  primer 
extension  analyses  with  total  mtRNA  from  line  Tx398.  Major  extension  products 
corresponding  to  -320,  -319,  -318,  -314,  -313,  -312,  -310,  -306,  -302,  -296,  -295,  -291, 
-289,  -279,  -277,  -275,  -273,  and  -271,  5’  to  the  start  codon,  were  detected  (Figure  10). 
The  longest  extended  fragment  ended  at  -322,  the  3'  terminus  of  tmfM  (Figure  1 1).  This 
pattern  of  multiple  extension  products  was  reproducible,  and  the  RNAs  used  in  these 
assays  gave  discrete  termini  in  primer  extension  experiments  with  other  genes.  This 


Figure  9.  Ribonuclease  protection  of  the  atp9  transcript.  Antisense  RNA  (solid  bar) 
synthesized  from  Bluescript  clone  pBY132  was  hybridized  with  capped  mtRNA,  and 
treated  with  ribonuclease.  The  protected  region  (dashed  line)  was  analyzed  with  M13 
sequence  as  a marker. 
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Figure  10.  Primer  extension  analysis  of  atp9.  Oligonucleotide  DP48  was 
used  in  dideoxy  sequencing  of  pBY132  and  primer  extension  reaction  with 
sorghum  mtRNA. 
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EcoRI 

SMIICGGAGAGGTCAGATGAGATAGATGGCrK?rGTTCAAAATGGAATGATCCX:CITTGATTGAATTTCTGTTGTCCACA  -721 
ACGAATAAAGAGTCOSAAAAGACGTACGCATAACAGACTCGOTACCATGGTCATAGGGTTCAGTGGAGAATTGCGrrATAC  -641 
AAGGAGCAAGAAACCTATGCX3CTTTAGCAACAAAGCGCTCATCCCTTGTTCTrCGCrrGGAAGCTCAATCCCCTCGCTTC  -561 

SacI 

TTCTTTCGCTTCTACGCTCTTGCTTTCAGCAAGTTCGTACXAGTCGTCAGTCAAGGAGGGAGCTCCAACCTCCGCGCCTA  -481 
GCCGCAACCTAAACGCTGGTTCTATCCCGGCCAAGCAAGCAAGGGGAACCGGTGGGAAGAGGGAAAGAAAGATCAGGGGA  -401 

Petl 

KGhAGCGGGGTAGAGGAATTGGTCGACTCATCAGGCTCATGACCTGAAAGACTGCAGGrrTCGAATCCTGTCCCCGCClRtL  -321 
V V V 

******  * 4 4*44  44444 

TCACGTCAGATAigrrGTACGAGTCCTGATCACCTGGGTCAAAGAATCTATGTTCAAgrCGCCTGCCTAACAAATAGACGA  -241 

<•  DP73 

CTGCGTACTGTAGTTAGTC7rG7TCCGTTGGTCa3ATCCTACCATGGAAAGTGTTGCTGT(TKyvAATCGAGATTGTGTGGG  -161 

TGTTTAGG7TTCTTA7*reCACTATGTGGGfrrraAGGTTTreXIAGTTTCATTGAAGCAACTCCTTTCXX7ITTTCAACTAAG  -81 

GAGAGGAGGAAGGAATCAAATCAACGAGAAATACTATACTCGAAATTTCGAGTAGCGAAGGAAAAGCGCGAAACAATGTC  -1 
o DP48 

ATG  +3 


Figure  1 1 . Nucleotide  sequence  of  the  5'  flanking  region  of  atp9.  The  trrriM  sequence  is 
shown  in  italics.  The  closed  arrows  represent  the  major  termini  mapped  by  primer 
extension  analysis.  The  locations  of  the  oligonucleotides  are  underlined.  The  open 
triangles  represent  the  termini  of  protected  fragments  from  in  vitro  capping  in 
conjunction  with  ribonuclease  protection  experiments. 
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suggests  that  atp9  was  co-transcribed  with  tmfM  and  the  post  transcriptional  processing 
event(s)  generated  heterogeneous  transcript  termini  of  atp9.  In  addition,  atp9  possessed  an 
initiation  region  that  initiated  transcription  at  the  positions  -309,  -305  and  -302  nt  (Figure 
11). 

Another  primer,  DP73  (Table  2),  which  is  located  220  bp  upstream  of  DP48  and  35 
bp  3'  to  the  3'  end  of  tmfM.,  was  used  to  detect  other  primary  transcripts.  No  extended 
fragments  could  be  detected.  The  primary  transcripts  may  be  processed  too  rapidly  to  be 
detected  by  this  assay. 

Phosphorylation  Reaction 

The  T4  polynucleotide  kinase  exchange  reaction  (Sambrook  et  al.  1989)  has  been 
reported  to  label  only  processed  mtRNA,  not  initiated  transcripts  (Mulligan  et  al.  1988a, 
Auchincloss  and  Brown  1989),  and  this  enzyme  was  used  in  this  study.  The  termini  other 
than  at  -309,  -305,  and  -302  presumably  resulted  from  processing  events,  and  could  be 
recognized  by  the  T4  polynucleotide  kinase.  Hybridization  of  5'  phosphorylated  mtRNA 
with  a Southern  blot  containing  subclone  pRS66  DNA  (Salazar  1991)  is  shown  ( Pstl - 
EcoRI,  Figure  8E).  Phosphorylated  mtRNA  hybridized  with  pRS66,  suggesting  that 
pRS66  includes  5'  termini  recognized  by  polynucleotide  kinase. 

Analysis  of  fmfM 

To  determine  if  there  were  other  copies  of  tmfM  located  in  the  sorghum 
mitochondrial  genome,  a Southern  blot  containing  restricted  total  mitochondrial  DNA  was 
probed  with  DNA  fragments  covering  part  (pRS72,  Salazar  1991)  or  all  of  tmfM  (Figure 
12  B).  MtDNA  from  line  Tx398  was  restricted  with  Pstl,  Smal  and  Xbal,  respectively, 
and  probed  with  an  EcoRl-Pstl  fragment  of  clone  pRS72,  which  includes  the  5’  flanking 


A.  XbaH  Small  Pstt 


B. 


Xba\  Smal  Pstl 


Figure  12.  Hybridization  of  restricted  mtDNA  (Xba\,  Smal  and  Pstl)  with 
pRS72  (A)  and  a fragment  (B)  covering  the  region  from  -195  to  -496  5' 
to  the  atp9  start  codon  (Figure  11).  pRS72  covers  the  region  from 
residue  -344  to  -799,  relative  to  the  atp9  start  codon  (Figure  1 1 ).  A 
single  hybridization  band  was  detected  in  A.  In  B,  two  hybridization 
bands  were  detected  when  the  mtDNA  was  restricted  with  Pst I,  since 
the  probe  includes  a Pstl  site  (Figure  11). 
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and  part  of  the  coding  sequence  of  fmfM,  and  probed  with  a fragment  covering  the  coding 
sequence  of  trrcfM.  Hybridization  showed  only  one  band  in  each  lane,  suggesting  that 
rrafM,  located  5'  to  atp9,  is  the  only  copy  of  this  gene  in  the  genome  of  Tx398.  Two 
hybridization  signals  were  detected  in  Figure  12  B when  mtDNA  was  restricted  with  PstI, 
as  the  probe  covers  a Pstl  site. 

To  further  determine  if  the  tmfM  is  transcribed,  a DNA  blot  containing  pRS72 
DNA  was  hybridized  to  labeled  small  molecular  weight  RNAs  (Figure  13).  The  small 
molecular  weight  RNAs  were  obtained  by  precipitation  of  LiCl  soluble  RNAs  with  ethanol, 
treated  with  DNAse  and  alkaline  phosphatase,  followed  by  polynucleotide  kinase  labeling. 
The  Sacl-Pstl  fragment  of  pRS72  (Figure  13)  hybridized  to  the  labeled  probe, 
demonstrating  that  tmfM.  is  transcribed.  Hybridization  of  the  EcofU-Sacl  fragment  (Figure 
13)  may  result  from  transcription  of  the  £cc>RI-5acI  region. 

Transcription  Initiation  of  orf!07 

Orfl07  is  a 321  bp  chimeric  open  reading  frame  of  the  A3  male  sterile  cytoplasm. 
The  chimeric  configuration  involves  310  bp  93%  similar  to  sequences  5'  and  internal  to 
atp9,  including  3 1 amino  terminal  residues  84%  similar  to  ATP9  (Tang  et  al.  in 
preparation).  Sequences  3'  to  the  atp9  similarity  are  of  unknown  origin.  The  carboxy 
terminal  54  residues  are  52%  identical  to  the  carboxy  terminus  of  orf79,  an  open  reading 
frame  associated  with  CMS  in  rice  (Akagi  et  al.  1994).  A total  of  four  editing  events  were 
observed  for  orfl07  transcripts,  altering  four  amino  acids,  including  one  within  the  atp9 
similarity  region  (Tang  et  al.  in  preparation). 

Transcripts  of  orfl07  include  species  of  about  1 100,  870,  810  and  380  nt.  The 
1 100,  870  and  810  nt  transcripts  are  major  transcripts  in  A3Tx398,  while  in  line  IS1 1 12C, 
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Figure  13.  Hybridization  of  pRS72  DNA  with  [y-32P]ATP  labeled  tRNA. 
Sad  and  Pst\  digested  pRS72  DNA  was  electrophoresed  in  a 1%  agarose 
gel,  and  blotted  to  nitrocellulose.  The  DNA  blot  was  probed  with  labeled 
tRNA  with  polynucleotide  kinase  and  [y-32P]ATP. 
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a transcriptional  processing  event  dramatically  reduces  the  abundance  of  the  1100,  870  and 
810  nt  transcripts,  and  appearance  of  high  levels  of  the  380  nt  transcript. 

Cloning  Fragments  5'  to  orfl07  and  Gene  Specific  Clones 

To  clone  a fragment  5'  to  clone  pHC86  (Table  1),  which  contains  orfl07  and  the  5' 
flanking  sequence  up  to  -485  (Tang  et  al.  in  preparation),  cosmid  clone  8G8  (Salazar  1991) 
DNA  was  digested  with  BglFL  and  probed  with  pHC86.  A 2.9  kb  Bgl II  fragment  was 
identified,  cloned  into  BglU  digested  pUC19  vector  (pBY124)  and  sequenced.  This  clone, 
pBY124,  was  used  for  subcloning  a 1.8  kb  EcoRl-BamHl  fragment  into  pUC19  (pBY128) 
as  well  as  the  Bluescript  vector  (pBY142)  (Figure  14).  Clone  pBY142  is  located  5'  to 
clone  pHC86. 

Another  DNA  segment,  the  214  bp  Xbal-BamHl  fragment  of  pHC103,  was  also 
cloned  into  the  Bluescript  vector  (pBY145,  Figure  14)  and  sequenced. 

In  vitro  Capping  and  Ribonuclease  Protection 

Single-stranded  antisense  RNA  was  synthesized  from  £coRI-digested  pBY142 
with  T3  RNA  polymerase,  and  from  YZ?aI-digested  pBY145  with  T7  RNA  polymerase, 
respectively.  Hybridization  of  the  capped  RNA  to  the  antisense  RNA  and  ribonuclease 
digestion  of  the  duplex  resolved  several  protected  fragments  (Figure  14).  In  Figure  14 A, 
antisense  RNA  used  for  protection  was  transcribed  from  pBY142  and  two  major  protected 
fragments  were  detected,  with  termini  corresponding  to  nucleotides  of  about  -537  and  -533 
5'  to  the  start  codon.  In  Figure  14B,  antisense  RNA  was  transcribed  from  pBY145,  and 
two  protected  regions  were  identified,  with  protected  termini  corresponding  to  -352,  -348, 
-292  and  -288  5'  to  the  start  codon. 


Figure  14.  Ribonuclease  protection  of  the  orfl07  transcript.  Antisense  RNAs  (solid  bar) 
synthesized  from  Bluescript  clones  pBY142  (riboprobe  A)  and  pBY135  (riboprobe  B) 
were  hybridized  with  capped  mtRNAs,  and  treated  with  ribonuclease.  The  protected 
regions  (dashed  line)  were  resolved  with  M13  sequence  as  a marker. 
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Primer  Extension 

To  identify  precisely  the  5'  transcript  termini  of  orfl07,  primer  extension 
experiments  were  conducted.  Oligonucleotide  DP  102  (Table  2),  complementary  to  a 
sequence  between  +65  and  +85,  and  is  located  3'  to  the  atp9  similarity  region,  was 
prepared  and  used  in  primer  extension  with  total  mtRNA  from  line  A3Tx398.  Several 
major  products  were  detected,  with  termini  mapped  to  residues  of  -233,  -237,  -268,  -292, 
-296,  and  -358  (Figure  15A),  5'  to  orfl07  (Figure  16).  In  addition,  a longer  fragment 
corresponding  to  the  nucleotide  of  about  -539  was  also  detected  (not  shown) 

Another  primer,  DP103  (Table  2),  complementary  to  a sequence  of  -364  to  -383, 
relative  to  the  start  codon,  and  460  bp  5'  to  DP  102,  was  used  to  verify  the  site  of  the  distal 
terminus  obtained  by  primer  DP102.  The  extended  fragment  was  analyzed  (Figure  15B). 

A major  terminus  was  detected  corresponding  to  residue  of  -539,  upstream  of  the 
translational  start  codon  (Figure  16). 

Transcription  Initiation  of  urf209 

An  open  reading  frame  coding  for  25  kD  gene  product,  "orf25",  has  been  identified 
in  many  plant  specifies,  including  tobacco  (Stamper  et  al.  1987),  maize  (Dewey  et  al.  1986, 
Stamper  et  al.  1987),  wheat  (Bonen  et  al.  1990),  rice  (Liu  et  al.  1992),  Marchantia  (Oda  et 
al.  1992)  and  Arabidopsis  (Brandt  et  al.  1992).  The  function  of  the  gene  product  is 
unknown,  and  the  open  reading  frame  is  variable  in  size  in  different  species.  In  sorghum, 
or/25  is  designated  as  urf209,  predicting  209  amino  acid  residues  (Tang  et  al.  submitted). 

A total  of  10  C-to-U  editing  events  were  revealed,  altering  nine  amino  acids.  Transcription 
analysis  showed  that  there  are  two  major  transcripts  of  1050  and  830  nt  in  IS  1 1 12C  (Tang 
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Figure  15.  Primer  extension  analysis  of  orf107.  A: 
Oligonucleotide  DPI  02  was  used  in  dideoxy  sequencing  of 
clone  pBY142  and  the  primer  extension  reaction  with  sorghum 
mtRNA.  B:  Oligonucleotide  DPI 03  was  used  in  dideoxy 
sequencing  of  clone  pBY142  and  the  primer  extension 
reaction. 
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v v 

♦ 

ATCTATTATTCAGATTATCTTAAATAATGGTCTGAGTAGTTTTAITTTCTTCAATCWrcGGCGGAGTIVn'TGAAGCACC  -481 


BamHI 

TTACCCTGGATCAACrcACXX3TCTTCAATCTTATGTATCTGCTGAGAACTGAGGAAAGCAGTCGAAGAAGCCATCGGATC  -401 

V V 

4 

CACTTAGGCGCGCGGACGGCCGCAATCCCAGACAAATATATTAATAAAGCATCCTrATTGCTGACCAiCCTTGTGATCAAA  -321 

ODP103  V V 

4 4 4 

TTGGATCAGGCAAAAAAATCAATAGTAGATAATGAAAGAAGAGTCAGCCCTTGGGGCACTAGGTGGGTCCCCCGTATGCA  -241 


4 4 Xbal 

GGTCGCTACrrAACTTGACAAGCAGTGGTGAAAGTGTTGCIH3TCTCAAATCTAGAriK3TGTGGGTGTTTAGGTTTCTTAT  -161 

TCCACTATGTGGGTTTTAGGTTTTCCAGTTTCATTGAAGCAA<^CrrCGCITTCATCTGTTCCACTAAGGAAAGGAAAG  -81 

GAGAGGAGGAAGGAATCAAATCAACGAGAAATACTATACTCGAAATTTCGAGTAGCGAAGGAAAAGCGCGAAACAATGTC  -1 

ATGTCGCGACTCTACATGTTAGAAGGAGCTAAATCAATAGGTGCTATAGCTGTAGTTTCCGCTGTAGCGGGAGCTTCCGC  +80 

CGGTATTTGGAATGGGATGAAAAGAAAGGAAACTTCCCCATTGCATCAATTCATGCTATTTGCCGCTTGTCCCATCATAG  +160 
o DP102 


Figure  16.  Nucleotide  sequence  of  the  5'  flanking  region  of  orf107.  The  closed 
arrows  represent  the  major  termini  mapped  by  primer  extension  analysis.  The 
locations  of  the  oligonucleotides  are  underlined.  The  open  triangles  represent  the 
termini  of  protected  fragments  from  in  vitro  capping  in  conjunction  with  ribonuclease 
protection  experiments. 
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et  al.  submitted),  and  primer  extension  experiments  identified  the  5'  termini  of  the  urf209 
transcripts  to  residues  of  -345  and  -134,  5'  to  the  start  codon. 

Cloning  Gene  Specific  Fragments 

A gene-specific  fragment  covering  the  termini  identified  by  primer  extension, 
including  5'  flanking  and  part  of  the  coding  sequences  of  urf209,  was  prepared  from  clone 
pHT98  digested  with  BglU  and  BamHl,  and  cloned  into  BamHI-restricted  Bluescript 
vector.  Sequencing  revealed  that  clone  pBY146  had  the  BglU  end  adjacent  to  the  T3 
promoter,  and  that  the  T3  RNA  polymerase  initiated  antisense  transcription  of  the  cloned 
fragment. 

In  vitro  Capping  and  Ribonuclease  Protection 

To  identify  if  the  termini  determined  by  primer  extension  are  primary  ends,  single 
stranded  antisense  RNA  was  synthesized  from  BamHI-digested  pBY146  with  T3  RNA 
polymerase,  and  hybridized  to  the  capped  RNA  from  IS  1 1 12C.  The  protected  fragment 
was  analyzed  (Figure  17).  A major  terminus  was  detected,  corresponding  to  nucleotide  of 
about  -331  (Figure  18). 


Transcription  Initiation  of  orf265 

Orf265  has  been  described  as  a chimeric  gene  specific  to  the  A3  male-sterile  sorghum 
cytoplasm,  and  resulted  from  a series  of  recombinational  events  (Tang  et  al.,  submitted). 
The  5'  region  of  the  open  reading  frame  includes  a 288  bp  sequence  95%  identical  to  the  3’ 
terminal  region  of  a repeat  present  5'  to  maize  T cytoplasm  atp6  and  T-urfl3  (Dewey  et  al. 
1986),  including  a common  transcription  initiation  region  for  the  two  genes  in  T cytoplasm 
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Figure  17.  Ribonuclease  protection  of  the  urf209  transcript.  Antisense 
RNA  (solid  line)  synthesized  from  Bluescript  clone  pBY146  was 
hybridized  with  capped  mtRNA,  and  treated  with  ribonuclease.  The 
protected  region  (dashed  line)  was  analyzed  with  Ml 3 sequence  as  a 
marker. 
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♦ v 

CCATTTTGAAAGAACCAAAAACCCTACTAGATGGTATTrGGAGAATATCCTAATAAGTTTAGCTTATTrCCTrGCTTTCT  -321 

GAAAATACCCTATTTTACATGGATTGACCTGCTAACCTACCTCTACTGGTTTCTAATTKXIATATCTTTAGGAAAGACCG  -241 

GATCAGAACATGAATTTCTGGGCTTACITAGAAGATGTIV3ATCGAAGTACTCTATATCCCTCTrATGGGGCTAAGCCCGC  -161 

ODP91 

TATCAGGCTGTGAQOTQGAATCAATATTTTCAA'rrGTATCTAATAGAATTTA'ITTrGAAGGCTTTCCTTGQAAAAACCTAC  -81 
GCCX^CCAAAAAGTCTO3CTTTCTCTTTCrCTTTTCGGGAGCAGACXriTCX^AAATCCAATCAAATa^TAGTTACTC?r  -1 
ATGATGATGACTAGATTGAGTTCAA(XK3ATATGAAGGATATAAATATGCTATTTGCTGCTATTCCATCTATTTGTGCATC  +80 
Bglll 

AAGTCCGAAGAAGATCTCAATCrATAATGAAGAAATGATAGTAGOTCGTTGTTTTATAGGCTTTCTCATATTCAGTCGGA  +160 


Figure  18.  Nucleotide  sequence  of  the  5'  flanking  region  of  urf209.  The  major 
terminus  mapped  by  in  vitro  capping  and  ribonuclease  protection  is  indicated  by  the 
open  triangle.  The  closed  arrow  represents  the  terminus  determined  by  primer 
extension  (Tang  et  al.  submitted). 
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maize  (Kennell  and  Pring,  1989).  Within  the  maize-like  sequence,  an  in-frame  start  codon 
is  present  followed  by  53  bp  of  maize-like  sequence  and  189  bp  of  sorghum  urf209  . The 
insert  corresponds  to  codons  51-1 14  of  urf209.  Sequences  of  106  bp  3'  to  the  urf209 
similarity  region  are  identical  to  the  sequence  5'  to  sorghum  atp6-2,  followed  by  sequences 
of  unknown  origin.  An  open  reading  frame  of  795  bp  was  deduced  in  this  configuration, 
encoding  265  amino  acids.  An  open  reading  frame  coding  for  130  residues  can  be  deduced 
if  a second  in-frame  start  codon  was  used.  A schematic  diagram  of  the  orf265  chimeric 
structure  is  shown  in  Figure  19.  The  chimeric  orf265  exhibits  four  base  pair  changes 
within  the  urf209  similarity  region,  changing  three  predicted  amino  acids.  Two  transcripts 
of  about  3500  and  3100  nt  were  detected  by  Northern  analysis. 

Cloning  Gene  Specific  Fragments 

To  map  the  5'  transcript  termini  of  orf265,  the  5'  flanking  region  of  orf265  was 
cloned.  A 310  bp  fragment,  which  is  between  -13  and  -330,  was  amplified  by  PCR  with 
primers  DP92,  5'-GAGAGGCTACCAGGACACG-3'  and  the  universal  primer  1201, 
using  pHT75  DNA  (Tang  et  al.  submitted)  as  a template.  The  amplified  fragments  were 
cloned  into  the  Bluescript  vector  and  sequencing  revealed  that  clone  pBY133  had  the  DP92 
end  adjacent  to  the  T3  promoter,  and  that  the  T3  RNA  polymerase  initiated  antisense 
transcription  of  the  cloned  fragment. 

Another  fragment  which  covered  the  region  between  -330  bp  to  about  -1.1  kb  from 
pHTlOO  (£coRI-£coRI)  was  cloned  into  the  Bluescript  vector,  and  sequencing  revealed 
that  clone  pBY149  had  the  distal  EcoRl  end  adjacent  to  the  T3  promoter  and  that  the  T3 
RNA  polymerase  initiated  antisense  transcription  of  the  cloned  fragment. 


Figure  19.  Ribonuclease  protection  of  the  orf265  transcript.  Antisense  RNAs  (shown  in 
solid  bar)  synthesized  from  Bluescript  clones  pBY149  (riboprobe  A)  and  pBY133 
(riboprobe  B)  were  hybridized  with  capped  mtRNA,  and  treated  with  ribonuclease.  The 
protected  regions  (dashed  line)  were  analyzed  with  M13  sequence  as  a marker  in  (A)  and 
with  clone  pBY148  sequence  as  a marker  in  (B). 
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Mapping  Transcription  Initiation  Sites  of  orf265 

Total  mtRNA  from  line  IS1 1 12C  was  capped  and  hybridized  with  antisense  RNAs 
synthesized  from  pBY149  and  pBY133,  respectively.  The  duplex  was  treated  with 
ribonuclease,  and  the  protected  regions  were  analyzed.  A terminus  corresponding  to  a 
residue  of  about  -547  (Figure  19A)  was  detected  with  the  antisense  RNA  derived  from 
clone  pBY149.  Using  antisense  RNA  synthesized  from  clone  pBY133,  two  major 
protected  fragments  were  identified,  with  termini  corresponding  to  residues  of -1 16  and 
-113  (Figure  19B)  5'  to  the  start  codon. 

Primer  Extension 

To  precisely  determine  5'  termini  of  the  orf265  transcripts,  primer  extension 
analysis  was  conducted.  Oligonucleotide  DP  127  (Table  2),  which  is  complementary  to  a 
sequence  of  +296  to  +278  5'  to  the  start  codon,  and  located  3'  to  the  urf209  similarity 
region,  was  prepared  for  primer  extension  analysis.  A major  terminus  corresponding  to 
residue  -116  was  identified  (Figure  20A).  In  addition,  a longer  product  was  also  detected, 
with  terminus  corresponding  to  the  nucleotide  of  -554  (not  shown). 

Primer  DP06  (Table  2),  which  is  complementary  to  a sequence  of +131  to  +1 12 
and  located  within  the  urf209  similarity  region  (Figure  19),  was  also  used  in  primer 
extension  analysis  and  gave  a result  consistent  with  DP  127  (Fig.  20B).  Since  primer  DP06 
is  located  within  urf209  similarity  region,  it  may  detect  the  termini  of  orf265  transcripts  as 
well  as  urf209  transcripts.  Another  oligonucleotide  DP131  (Table  2),  complementary  to  a 
sequence  of  -31  to  -51,  relative  to  the  start  codon,  and  is  located  upstream  of  the  urf209 
similarity  region  (Figure  19),  was  used  to  map  the  longer  transcript  terminus.  A major 
terminus  was  detected  corresponding  to  the  nucleotide  of  -554  upstream  of  the  orf265  start 
codon  (Figure  20C,  Figure  21). 
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Figure  20.  Primer  extension  analysis  of  orf265.  A:  Oligonucleotide 
DPI  27  was  used  for  the  primer  extension  reaction  with  clone  pBY146 
sequence  as  a marker.  B:  Oligonucleotide  DP06  was  used  in  primer 
extension  with  Ml 3 sequence  as  a marker.  C:  Oligonucleotide  DPI 31 
was  in  primer  extension  with  the  C track  of  Ml 3 sequence  as  a marker. 
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v 

TAACTCGTACATAACATAAAAAGTGATGTTCACTAAGTTAATAGATCGAACGAACrci*IGTGTTGCGAGCGAAAACCAAG 

a rrrwyyva'  • i^’TTr^ftrtAr^ty^AAGAAGAGAAG'ITCTGTGCACCCTACAAAGCCCu-i-rAGACCCACTCATGATAGATCA 

o DP131 


Kpnl 


CCTTACCTTCGAAGGCAGCCAAACATGCCTTCTCG'ITTCACTAAACAGGCCTCTCAACCCG'riTTGACGGTGGTACCCAT 


TCTGCTAGTGCGAAACGrTGTTCCCCGTTTCTTC»TGATCCGATCCGGTAT<^GCGGATACTTGCATTTTACCGGGAAGGG 

TTGAAACAATGGAATGCAACGCAACAAGAGCGTATGTTAAGTTCTATAGriK3ATG?IX3ATGCGGCTCGATTCATCATTGCT 

V V 

♦ 

TCCACAAAAAATCAATGTCCXn'TITGACTCTAAAATTATCATAGAGAAAGATGTTCTGA'rrCAG'rrCTCTCTGAAAAGAA 

GACGGGGCCCTTAGGGACACACTAGTACCA'ITTCCATTGTGCGAAAGGTCGIKjTCCTGCTAGCCTCrcTGCTGTAGTAAG 

ATGAACXX7TCTCATTGCATCAAACAGTCAATTCAACCAu-rl%j'ljrAAAl“I\jl<AAATCTATATTGTGTQG<ji,vJiJl-CAGTCG 


-481 

-401 

-321 

-241 

-161 


-81 

-1 

+80 


fia  AGAPTTTCJlAAGAAACTOTCGACGGGAGAATCGAGTCrrATTCAGGAAGAATTGCAGC^TTCTTCA  +160 

o DP06 


ATCCTAACGAAGTCAITCCGGAGGAATCCAATGAACAACAACGGTTACTTAGAATCAGCITGCGAATTTGCAGCACCGTA  +240 

r:Tar:aAPraTT&rrArr’f^JAC!TA(^TTTGGTTTTQTAATGGCAGTCCGTGTTCGTGTTCGTGTTa3TGTTCGAATTT  +320 

o DP127 


Figure  21 . Nucleotide  sequence  of  orf265  and  the  51  flanking  region.  The  closed 
arrows  represent  the  major  termini  mapped  by  primer  extension  analysis.  The 
locations  of  the  oligonucleotides  are  underlined.  The  open  triangles  represent  the 
termini  of  protected  fragments  from  in  vitro  capping  in  conjunction  with  ribonuclease 
protection  experiment. 
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In  vitro  Transcription  of  atp6-2  with  Maize  Extract 

Plasmid  pBY148  insert  (EcoRI-Bg/II)  covering  a region  from  -38  to  -338  (Figure 
7)  5'  to  the  atp6-2  start  codon  was  programmed  with  maize  transcriptional  extract.  A 
transcript  of  160  nt  was  detected  (Figure  22),  corresponding  to  the  initiation  at  the  -188 
nucleotide  (Figure  6)  and  the  termination  at  the  end  of  the  template  ( BglU  site), 
demonstrating  that  the  sequence  surrounding  -188  nt  was  specifically  recognized  by  RNA 
polymerase.  Thus  the  DNA  element  around  -188  is  referred  to  as  a promoter. 


Figure  22.  In  vitro  run-off  transcript  analysis.  Clone  pBY148  was  digested  with  £coRI 
and  Bglll  to  release  the  insert,  and  the  insert  was  used  as  a template  in  an  in  vitro 
transcription  reaction  with  maize  extract.  Products  were  analyzed  with  M13  sequence  as  a 
marker. 
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DISCUSSION 


Transcript  patterns  of  plant  mitochondrial  genes  are  usually  complex.  Frequent 
recombinations  and  rearrangements  may  shuffle  a promoter  region  and  result  in  chimeric 
sequences  or  loss  of  promoter  function  (Gray  et  al.  1992).  Initiation  of  transcription  may 
occur  at  one  or  several  promoters  for  one  gene  (Gray  et  al.  1992).  Furthermore,  extensive 
transcriptional  processing  may  generate  heterogeneous  5'  transcript  termini  that  can 
superimpose  a further  complexity  to  the  transcript  patterns  of  plant  mitochondrial  genes. 
Alignments  of  sequences  surrounding  transcription  initiation  sites  of  plant  mtDNA  revealed 
a limited  conserved  YRTA  motif  (Rapp  and  Stem  1992).  The  YRTA  block  has  been  tested 
in  an  in  vitro  transcription  system  and  shown  to  be  an  essential  element  for  the  promoter 
function  of  maize  atpl  (Rapp  and  Stem  1992,  Rapp  et  al.  1993).  However,  DNA 
sequences  around  transcription  initiation  sites  for  some  plant  mitochondrial  transcripts 
show  little  similarity  to  the  YRTA  element  (Binder  et  al.  1994,  Lizama  et  al.  1994, 

Mulligan  et  al.  1991).  Taken  together,  these  characteristics  may  indicate  that  the  plant 
mitochondrial  transcription  complex  may  not  have  a stringent  sequence  recognition 
requirement. 

To  provide  more  information  about  plant  mitochondrial  transcription  initiation,  we 
investigated  5'  transcript  termini  of  Sorghum  bicolor.  The  objectives  of  the  study  include 
characterization  of  transcription  initiation  of  selected  sorghum  mitochondrial  genes  and  open 
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reading  frames  as  a preface  to  elucidate  mitochondrial  gene  regulation  at  the  transcriptional 
level.  The  availability  of  basic  transcriptional  data  and  cloned  5'  sequences  allowed  the 
examination  of  nine  transcription  initiation  regions  for  four  genes  and  two  open  reading 
frames,  which  should  give  a good  representation  of  transcriptional  initiation  in  sorghum 
mtDNA. 

The  nine  initiation  regions  identified  in  this  study  by  ribonuclease  protection  are, 
generally,  in  good  agreement  with  the  data  by  primer  extension  experiments.  This 
concordance  are  interpreted  as  evidence  that  the  number  and  the  position  of  the 
transcription  initiation  sites  identified  through  ribonuclease  protection  experiments  are 
consistent  with  the  results  obtained  by  primer  extension  analyses.  Assignments  of  the  5' 
termini  of  the  protected  RNA  fragments  to  the  corresponding  DNA  sequence  were  based 
on  sizing  of  the  protected  fragments.  The  sizing  of  protected  RNA  was  approximated  by 
electrophoresis  using  M13  or  mtDNA  genomic  clones  as  markers,  taking  the  RNA  and 
DNA  mass  differences  into  account.  Several  protected  fragments  were  sized  somewhat 
shorter  than  predicted  by  primer  extension  data.  Instability  at  the  3'  terminus  of  the  DNA- 
RNA  heteroduplex  due  to  specific  nucleotide  sequences,  which  could  allow  ribonuclease 
hydrolysis,  might  explain  these  discrepancies.  Considering  the  complexities  of  sizing 
protected  fragments,  results  from  primer  extension  experiments  were  thus  used  in  promoter 
sequence  alignments  in  this  investigation. 

On  the  other  hand,  experiment  artifacts  can  be  obtained  by  primer  extension 
through  unstable  template-enzyme  complexes  influenced  by  secondary  structure  or 
sequences  that  inhibit  stability,  resulting  in  artificial  5'  transcript  termini.  Moreover, 
sequences  5'  to  some  plant  mitochondrial  genes  may  share  sequence  similarities  (Pring  et 
al.  1992),  and  thus  an  oligonucleotide  may  anneal  to  sequences  5'  to  other  genes.  Taking 
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these  possibilities  into  considerations,  most  primer  extension  experiments  in  this 
investigation  were  performed  with  more  than  one  oligonucleotide  to  confirm  each  result. 

The  Transcription  Initiation  Regions  of  atp6-l  and  atp&l 

Consistent  results  of  mapping  5'  transcript  termini  of  sorghum  atp6-l  were 
obtained  by  ribonuclease  protection  and  primer  extension  experiments,  as  exemplified  by 
the  number  and  the  positions  of  the  termini  identified.  The  two  5'  transcript  termini  of 
atp6-l  were  assigned  to  the  corresponding  DNA  sequence  of  -441  and  -437  by  in  vitro 
capping  and  ribonuclease  protection,  and  to  residues  of  -442  and  -439  by  primer  extension. 
Based  on  the  concordance  of  the  results  by  two  different  approaches  of  protection  and 
primer  extension,  transcription  of  atp6-l  was  assumed  to  initiate  at  two  initiation  sites  of 
-442  and  -439.  The  tentative  3'  transcript  terminus  of  atp6-l  is  at  +1 103  (D.  R.  Pring  and 
F.  Kempken,  unpublished  data),  and  thus  initiation  of  transcription  at  -442  and  -439 
generates  transcript  of  about  1540  nt,  which  is  concordant  with  the  result  obtained  from 
Northern  analysis  (Mullen  et  al.  1992).  Analysis  of  sequences  surrounding  the 
transcription  initiation  sites  at  -442  and  -439  revealed  that  the  atp6-l  promoter  lacks  a 
YRTA  core  element.  Instead,  a CTTA  element  was  found,  with  initiation  at  +5  and  +8 
relative  to  the  CTTA. 

Two  protected  termini  were  detected  for  atp6-2  by  ribonuclease  protection 
experiments.  Primer  extension  analyses  assigned  the  5’  terminus  of  the  atp6-2  transcript  to 
-188,  concordant  with  the  result  of  the  long  protected  fragment.  Since  primer  extension 
detected  only  one  terminus,  in  contrast  of  two  termini  identified  by  protection,  the  5' 
transcript  terminus  of  atp6-2  was  tentatively  assigned  to  the  terminus  of  the  long  protected 
fragment,  which  is  the  identical  terminus  detected  by  primer  extension.  The  short  terminus 
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might  result  from  ribonuclease  digestion  extending  into  3’  terminus  of  the  protected 
fragment. 

DNA  sequence  comparisons  showed  that  the  transcription  initiation  region  of 
sorghum  atp6-2  is  highly  similar  to  a maize  atp6  promoter  (Rapp  and  Stem  1992), 
including  CGTA  element.  A sequence  beginning  36  bp  5'  to  atp6-2  is  identical  to 
chloroplast  rm23  (Pring  et  al.  1993).  The  mitochondrial  transcriptional  apparatus 
apparently  recognized  the  sequence  3'  to  the  chloroplast  rm23  and  can  initiate  transcription 
of  atp6-2.  This  suggests  that  the  36  bp  immediately  upstream  of  the  initiation  site  may  be 
sufficient  for  the  RNA  polymerase  recognition.  It  may  also  be  argued  that  the  chloroplast 
sequence  retained  sequence  element(s)  that  could  be  recognized  by  mitochondrial 
transcription  complex. 


The  Transcription  Initiation  Region  of  atp9 

Transcription  of  atp9  is  characterized  by  a single  transcript  of  650  nt  (Salazar, 

1991).  Since  a minimum  transcript  length  of  636  nt  is  required  to  cover  both  tmfM  and 
atp9  to  the  edited  stop  codon,  the  650  nt  transcript  is  assumed  to  be  the  mature  transcript  of 
atp9  (Salazar,  1991).  Given  that  all  plant  mitochondrial  tmfM  identified  are 
mitochondrion-encoded  (Marechal-Drouard  et  al.  1993),  and  there  is  only  one  copy  of 
tmfM  in  the  Tx398  genome,  this  copy  of  tmfM  is  expected  to  be  a functional  gene. 

The  assigned  primer  extension  termini  and  the  termini  suitable  for 
guanylyltransferase  of  atp9  represent  a novel  observation.  In  other  similar  systems  with 
tmfM  present  5'  to  a protein  coding  gene,  excision  of  tmfM  has  been  postulated  to  be  a 
precise  endonucleolytic  cleavage  at  the  3'  terminus  of  tmfM,  generating  the  mature  3'  end 
of  tmfM  and  a 5'  terminus  of  the  3'  transcript  (Singh  and  Brown  1991).  The  multiple 
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termini  of  sorghum  atp9  primer  extension  may  represent  multiple  endonucleolytic  cleavages 
producing  multiple  5'  termini  of  the  atp9  transcripts.  Alternatively,  multiple  transcript 
termini  may  result  from  primer  extension  artifacts,  as  described  previously.  Interactions  of 
the  atp9  transcripts  and  the  AMV  reverse  transcriptase  may  be  influenced  by  secondary 
structures  of  the  atp9  transcripts  and  lead  to  unstable  template-enzyme  complex.  This 
unstable  complex  may  result  in  the  dissociation  of  the  reverse  transcriptase  from  the 
template-enzyme  complex. 

Surprisingly,  some  of  the  atp9  transcript  termini  were  substrates  for 
guanylyltransferase,  with  a 5'  sequence  5'- 

7AATC  ACGTC  AG  AT  ACTTGTACG  AGTCCT  G ATC  A-3 ' , bearing  a YRTA  core  motif 
(TGTA,  boldface.  The  3'  end  of  rm  fM  is  shown  as  the  italicized  T).  The  initiation  sites 
(single-underlined  nucleotides,  plus  the  C residue  adjacent  to  the  TGTA  core)  were 
assigned  to  nucleotides  of  -309,  -305  and  -302.  Given  the  complex  pattern  of  the  atp9 
primer  extension,  only  approximate  assignment,  rather  than  unambiguous  positioning,  of 
the  5’  initiated  termini  to  the  corresponding  primer  extension  termini  was  possible.  Since 
all  transcription  initiation  sites  identified  for  plant  mtDNA  were  positioned  3'  to  the  core 
YRTA  block  (Hanic-Joyce  and  Gray  1991,  Rapp  and  Stem  1992),  the  transcription 
initiation  sites  of  atp9  were  thus  assigned  to  residues  of  -302,  -296  or  -295,  and  -291 
(double-underlined  residues  of  above  sequence)  with  the  initiation  sites  at  +1,  +7  or  +8, 
and  +12,  agreeing  with  primer  extension  data. 

Assuming  the  termini  suitable  for  guanylyltransferase  were  primary  transcript  ends, 
the  novel  observation  of  initiation  of  transcription  at  a region  3'  to  rmfM  suggests  that 
sequences  5'  to  the  initiation  site,  as  few  as  about  21  bp,  may  be  sufficient  for  the 
recognition  of  RNA  polymerase,  with  certain  3'  sequence  elements.  Alternatively,  apart 
from  the  21  bp  DNA  segment,  some  5'  sequences  corresponding  to  part  of  rmfM  coding 
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region  may  be  involved  as  a promoter  of  atp9  and  play  a role  in  the  initiation  of 
transcription. 

Transcript  termini  of  sorghum  mitochondrial  atp9  may  therefore  be  characterized  by 
two  origins:  processing  of  the  precursor  at  the  a region  3'  to  irnfM,  resulting  in  5' 
processed  transcript  termini;  and  uniquely,  initiation  of  transcription  3'  to  the  rmfM, 
generating  5'  primary  atp9  transcripts. 

The  observation  of  unusual  processing  generating  substrates  suitable  for 
guanylyltransferase  revealed  the  possibility  that  capped  termini  may  not  be  necessarily 
associated  with  initiation  termini  in  some  circumstances  (L'Abbe  et  al.  1990).  As  a result, 
the  putative  transcription  initiation  region  of  atp9  identified  in  this  investigation  need  to  be 
examined  in  an  in  vitro  transcription  system. 

The  Transcription  Initiation  Regions  of  orfl07 

Transcripts  of  orfl07  in  A3Tx398  include  species  of  about  1 100,  870,  and  810  nt 
(Tang  et  al.,  in  preparation),  and  initiation  of  transcription  of  orfl07  was  determined  to 
occur  at  residues  of  -537,  -533,  -352,  -348,  -292  and  -288,  5'  to  the  start  codon. 
Compared  with  the  data  obtained  by  primer  extension,  which  includes  termini  at  -537, 
-358,  -296,  -292,  -268,  -237  and  -233,  transcript  termini  at  -268,  -237  and  -233  were 
assigned  to  the  processed  transcripts  while  other  termini  corresponded  to  the  initiated  ends. 
The  distal  terminus,  at  -537  by  primer  extension,  is  correlated  with  the  two  protected 
fragments  at  -537  and  -533.  The  middle  terminus,  at  -358  determined  by  primer  extension 
corresponds  to  protection  fragments  at  -352  and  -348.  The  two  proximal  termini  at  -296 
and  -292  by  primer  extension  are  in  agreement  with  two  termini  of  -292  and  -288  by 
protection,  and  thus  initiation  at  sites  -296  and  -292  is  proposed.  Taken  together,  three 
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initiation  regions-  the  distal,  middle  and  proximal  regions-were  identified  for  orfl07 
transcripts,  with  single  initiation  sites  at  -537  and  -358  for  the  distal  and  middle  region, 
respectively,  and  two  initiation  sites  at  -296  and  -292  for  the  proximal  initiation  region. 

Alignments  of  the  sequences  surrounding  the  initiation  sites  indicate  that  the  middle 
region  is  in  good  agreement  with  the  YRTA  (TATA)  motif  with  initiation  at  +3  relative  to 
the  TATA  element.  The  other  two  initiation  regions,  however,  do  not  fit  the  YRTA 
consensus.  Instead,  they  contain  a CTTA  or  AATA  with  initiation  at  +1  and  +2 
respectively. 


The  Transcription  Initiation  Region  of  urf209 

A single  protected  fragment  was  detected  by  ribonuclease  protection  corresponding 
to  a single  terminus  identified  by  primer  extension  at  -345  (Tang  et  al.  submitted).  There  is 
a 14  bp  difference  between  the  data  of  -33 1 (Figure  17)  of  protection  and  -345  by  primer 
extension.  This  difference  was  the  largest  discrepancy  among  examples  described,  and 
may  have  resulted  from  possible  ribonuclease  digestion  extending  into  the  3'  terminus  of 
the  protected  fragment.  The  data  do  suggest  that  a single  terminus  is  associated  with  a 
single  initiation  site,  which  was  assigned  to  the  terminus  of  1050  nt  transcript  of  urf209. 
The  primer  extension  data  was  thus  used  for  sequences  comparisons.  Alignments  of 
sequences  surrounding  the  initiation  site,  assigned  to  -345,  revealed  that  the  initiation 
region  lacked  a YRTA  motif,  and  instead,  a AATA  element  was  present,  as  shown  for  the 
proximal  initiation  region  of  orfl07. 
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The  Transcription  Initiation  Regions  of  orf265 

Two  transcription  initiation  regions  were  identified  for  orf265,  which  are  about  400 
bp  apart,  consistent  with  the  analysis  of  steady  state  transcripts  of  3500  and  3 100  nt  (Tang 
et  al.  submitted).  The  transcript  terminus  at  -1 16  (underlined  nucleotide),  5’- 
ACTCTAA4ATTATCATAGAGAAAGA-3',  is  within  a maize-like  sequence,  and  resides 
in  a promoter  region  (the  proximal  promoter)  for  T-urfl3  in  maize  (Kennell  and  Pring 
1989).  Comparing  the  sequences  surrounding  the  initiation  site  of  orf265  in  sorghum  and 
T -urfl3  in  maize,  some  base  substitutions  were  found  (italicized  nucleotides).  The 
substitutions  at  -2,  and  -6  to  -8,  relative  to  the  YRTA  core,  fall  into  the  N regions  of  a 
consensus,  A_NNYNRYRTAT,  derived  for  maize  atpl  and  other  maize  promoters  (Rapp 
et  al.  1993).  Consequently,  the  substituted  DNA  segment  still  retained  a promoter 
function. 

A sequence  of  288  bp  5'  to  sorghum  orf265  is  95%  identical  to  a maize  repeated 
sequence  present  5'  to  maize  T-urfl3  and  atp6  (Tang  et  al.  submitted).  Sequences  5'  to  the 
recombination  break  point  of  orf265,  5'- 

GAAACA  ATGGAATGCAACGCAAC  A AGA  GCGTA  TGTTAAGTTCTATAGT-3 ' 

(the  sequences  shared  by  maize  and  sorghum  are  in  italics)  share  some  similarities  to  the 
maize  sequence,  5'- 

ATT  GAT  GGGT  A AT  AGTTTT  GAT  AG  AAA  ACT  CGTA  TGTTAAGTTCTCTA  GT-  3 ' 

This  maize  sequence  includes  the  middle  promoter  of  T-urfl3  and  atp6  in  maize  (deduced 
from  Rapp  and  Stem  1992;  and  Dewey  et  al.  1986).  The  corresponding  orf25  sequence  is 
not  active  in  transcription  initiation,  although  the  sorghum  DNA  region  includes  CRTA 
(boldface)  from  the  maize-like  sequence  and  repeats  of  GAA  or  CAA  (underlined)  present 
5’  to  the  break  point.  This  suggests  other  5'  domains  in  addition  to  the  core  CRTA  may  be 


94 


required  for  the  promoter  function.  Another  possibility  that  cannot  be  ruled  out  is  that  an 
adenosine  substitution  of  cytosine  at  +12  (not  shown)  relative  to  the  YRTA  core  motif 
might  abolish  promoter  function,  since  sequences  3'  to  the  1 1 nt  sequence  have  not  been 
tested  for  effects  on  promoter  function  (Rapp  and  Stem  1992). 

A distal  initiation  region  was  also  identified  for  orf265,  with  a sequence  5'- 
GAATT AACTCGT ACATAA-3',  surrounding  the  initiation  site  -554.  This  sequence  block 
includes  a perfect  CRTA  and  upstream  GAA  repeat,  with  initiation  at  +4,  concordant  with 
proposed  promoter  motifs  for  other  species  (see  below). 

In  vitro  Transcription  of  atp6-2 

Among  the  nine  regions  of  transcription  initiation  identified  in  this  study,  a 
sequence,  5-GAAAACTCGTATAATAG-3',  covering  the  initiation  site  (underlined)  of 
atp6-2  is  highly  similar  to  the  maize  atp6  promoter,  and  thus  a DNA  fragment  bearing  the 
atp6-2  transcription  initiation  region  was  tested  in  a heterologous  in  vitro  transcription 
system  with  maize  extract.  Partially  purified  RNA  polymerase  from  maize  mitochondria 
recognized  the  promoter  of  sorghum  atp6-2,  and  initiated  at  a position  consistent  with  the 
-188  site  assigned  from  primer  extension,  and  the  longest  protected  fragment  from  in  vitro 
capping  and  protection  experiments  (Figure  22).  Heterologous  in  vitro  transcription 
studies  have  been  conducted  in  animal  and  fungi  systems  (Clayton  1991,  Costanzo  and 
Fox  1990),  and  some  compatible  or  incompatible  interactions  have  been  revealed  (Clayton 
1991,  Costanzo  and  Fox  1990).  No  such  studies  have  been  reported  in  plant 
mitochondrial  systems  to  date.  That  the  maize  RNA  polymerase  recognizes  a sorghum 
promoter  suggests  the  transcription  machinery  of  the  two  genera  may  be  related. 
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Consensus  Motif  of  Sorghum  mtDNA  Promoters 

In  an  effort  to  ascertain  sequence  similarities  between  different  initiation  regions  of 
the  sorghum  mitochondrial  genes  and  open  reading  frames,  sequences  surrounding  the 
initiation  sites  were  compared  and  aligned  (Figure  23).  A consensus  block  of  YRTA  was 
apparent  for  five  of  the  nine  representatives,  indicating  some  functions  of  the  conserved 
motif  in  promoting  transcription  initiation  of  sorghum  mtDNA,  as  described  in  other 
systems  (Mulligan  et  al.  1991,  Rapp  and  Stem  1992,  Rapp  et  al.  1993).  All  mitochondrial 
promoters  analyzed  to  date,  including  those  of  animals,  yeast  and  plants,  with  one  exception 
of  the  HSP  in  mouse  (Chang  et  al.  1986b),  are  characterized  by  a DNA  segment  that  covers 
the  transcription  initiation  site  (Biswas  et  al.  1990,  Clayton  1991,  Rapp  and  Stem  1992). 
Furthermore,  the  corresponding  elements  identified  in  maize  by  mapping  of  5'  transcript 
termini  in  vivo  (Mulligan  et  al.  1991)  have  been  demonstrated  to  be  indeed  critical  for 
promoting  transcription  initiation  in  vitro  (Rapp  and  Stem  1992,  Rapp  et  al.  1993),  as  well 
as  for  protein  binding  (Newton  et  al.  1995).  Consequently,  the  sequence  elements 
described  here  are  reasonably  referred  to  as  part  of  the  putative  promoters  of  sorghum 
mtDNA,  although  definitive  characterization  awaits  the  examination  of  these  DNA  elements 
in  an  in  vitro  transcription  system. 

The  sorghum  mitochondrial  putative  promoter  sequences  were  aligned  relative  to  the 
YRTA  core  of  Rapp  and  Stem  (1992)  and  Rapp  et  al.  (1993).  Five  of  the  nine  promoters 
bear  the  YRTA  element,  including  promoters  of  obligate  genes  atp6-2  and  atp9.  The  four 
promoters  that  do  not  fit  YRTA  include  the  promoter  of  urf209  and  the  proximal  promoter 
of  orfl07,  which  exhibit  an  AATA  (ARTA)  motif.  The  promoter  of  atp6-l  and  the  distal 
promoter  of  orfl07 , which  display  a CTTA  motif,  also  deviate  from  the  consensus 
sequence.  The  initiation  sites  range  from  +1  to  +5  for  most  of  the  examples,  with  the 
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Zraatpl 
ZMatp6/T- 
ZMatp6/T- 
ZMatp6/T- 
Sbatp6-2 : 
Sb or f 107: 


Stour f 2 09 
Stoorf265 

Stoat p9 : 
Stoatp6-1 


-29  -20  -10 

TICTTATIAT1AAGCCGAAAACTAA 
-urfl3A  CTCTAAGCATCTIOXATGAAT^ 
-urfl3B  ToGCTAATACTTIlGATAGAAAACT 

TATCATCTITGCTCCTCA^ 

a)  TITTAATCTATrATIC^ 

b)  GCGGACGGCCGCAATCCCAGACAAA 

c ) TCAAATICGATCAGGCAAAAAAATC 
CTAGATCCTATITCTAGAATAT^ 

a)  GAAAATAGGCACQCACGAATIAACT 

b)  ATCTCCCTiTlGACTCTrAAAATTAT 
GCCCnOCCTAATCACGnXAGATACT 
TCGACTCTACCTCTACCTGA^^^ 


-1+1  5 10  14  ST 

CCTA  CTAAAAAACAAACT  +1 
CCTA  TATAAGCCTACTTA  +4 
CCTA  TCTTAACTTUICT  +3 
CATA  GAGAAAGATCTICT  +5 
CCTA  TAATAGTCTCATIG  +5 
CTTA  AATAAT3CTCTGAG  +2 
TATA  TTAATAAAGCATCC  +3 
AATA  CTAGATAATCAAAG  +1, 
AATA  AGITTAGCTIATTT  +2 
CCTA  CATAACATAAAAAG  +4 
CATA  GAGAAAGATICTICT  +2 
TCTA  CGAGTCCIGA1CAC  1, 
CTTA  CAATGGAGATCTAC  +5, 


+5 


7/8, 

+8 


12 


maize  consensus: 
wheat  consensus: 
consensus  for  dicots: 
yeast  consensus: 


A/TYRTAG/TAA/TAAA 

RAaaNNCCRTAtARtRagt 

RAaAtNtCRTAaGaGaGAa^SIaaR 

ATATAACTA 


Figure  23.  Sequences  surrounding  the  transcription  initiation  sites  of  selected 
sorghum  mtDNAs  and  the  promoter  consensus  of  mtDNAs  for  other  species. 
Transcription  initiation  sites  are  underlined.  The  conserved  YRTA  sequence  is 
underlined.  ST  stands  for  the  site(s)  of  transcription,  relative  to  the  YRTA. 

R indicates  purine,  and  Y stands  for  pyrimidine.  Zmatpl  represents  the 
promoter  for  maize  atpl  (Rapp  and  Stem  1992).  Zmatp6/T-urf13A,  Zmatp6/T- 
urf13B,  and  Zmatp6/T-urf13C  represents  the  distal,  middle  and  proximal 
promoter  of  atp6  and  T-urf13  in  maize,  respectively  (Kennell  et  al.  1989,  Rapp 
and  Stern  1992).  Sborf107a,  Sborf107b  and  Sborf107c  indicates  the  distal, 
middle  and  proximal  initiation  region  of  sorghum  orf107,  respectively. 
Sborf265a  and  Sborf265b  represents  the  distal  and  the  proximal  initiation 
region  of  sorghum  orf265,  respectively. 
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second  atp6-l  site  at  +8,  and  the  second  and  third  sites  of  atp9  at  +7  and  +8.  Assignment  of 
the  core  of  the  proximal  promoter  of  orf!07  could  be  moved  three  bases  3'  to  the  AATA 
(ART A),  yielding  an  AGTA  (ART A),  but  one  of  the  two  initiation  sites  would  be  assigned 
to  the  G residue  of  internal  AGTA  block,  which  is  inconsistent  with  other  proposed 
positioning  relative  to  the  core  (Rapp  and  Stem  1992,  Rapp  et  al.  1993). 

The  core  domain  of  YRTA  has  been  found  to  be  necessary,  but  insufficient  to 
promote  transcription  of  cloned  fragment  by  in  vitro  analysis  of  the  maize  atpl  promoter 
(Rapp  and  Stem  1992).  Other  sequence  element(s),  such  as  the  repeated  elements  G(A)3-4 
or  C(A)3-4,  present  just  upstream  of  the  transcription  initiation  site,  as  found  in  several 
plant  systems,  may  be  required  for  specific  transcription  initiation.  Although  the  guanine 
preceding  (A)n  has  been  shown  to  not  be  essential  in  vitro  for  maize  atpl  promoter  function 
as  demonstrated  by  mutating  the  GAAAA  element  to  TAAAA  (Rapp  and  Stem  1992),  the 
GAAAA  could  play  a role  in  vivo  for  promoting  transcription  initiation  or  the  critical 
elements  may  reside  within  the  AAAA  domain.  This  upstream  domain  was  positioned 
approximately  10  bp  5'  to  the  core  motif,  corresponding  to  one  helical  turn  (Rapp  et  al. 
1993).  The  nine  proposed  promoters  described  here  each  exhibits  elements  of  this  domain, 
ranging  from  (G/T)AAAA  to  GA,  and  the  domain  is  centered  around  -10,  relative  to  the  core 
YRTA. 

The  two-domain-model-one  core  motif,  one  upstream  element-has  been  proposed 
for  the  maize  atpl  promoter  (Rapp  et  al.  1993).  Based  on  the  data  of  the  nine  putative 
promoters  identified,  sorghum  mtDNA  promoters  are  expected  to  bear  similar  two  domains. 
Inspection  of  sequences  between  the  two  domains  of  the  nine  sorghum  promoters  as  well  as 
promoters  of  maize  and  wheat  revealed  that  sequences  between  the  two  domains  lack 
nucleotide  similarity,  but  rather  a pyrimidine  region  with  one  to  several  nucleotides  of  C 
or/and  T is  always  present.  Conversely,  a DNA  fragment  with  two  perfect  domains  and  a 
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pyrimidine  region  in  between  may  not  be  a promoter,  since  other  factors  may  be  involved  in 
initiation  transcription,  in  addition  to  the  putative  necessary  elements.  This  circumstance  is 
represented  at  the  recombinational  break  point  of  the  288  bp  sorghum-maize  region  in 
sorghum  orf265.  Sequences  surrounding  the  break  point  resemble  the  middle  promoter  of 
T-urflS  in  maize  (deduced  from  Rapp  and  Stem  1992  and  Dewey  et  al.  1986),  including  a 
CGTA  core  motif  residing  in  the  maize-like  sequence  and  a upstream  domain  of  CAA  and 
GAA  from  sorghum  sequence  region,  but  no  pyrimidine  in  between.  This  chimeric  region 
surrounding  the  break  point  has  been  shown  to  not  be  active  as  a promoter  for  orf265. 
Nevertheless,  a possible  role  of  this  pyrimidine  region,  based  on  inspection,  can  only  be 
concluded  by  testing  in  an  in  vitro  transcription  system. 

In  contrast  to  the  systems  in  Xenopus  laevis  (Bogenhagen  and  Romanelli  1988), 
Neurospora  crassa  (Kubelik  et  al.  1990),  yeast  (Marczynski  et  al.  1989)  and  T7 
bacteriophage  (Klement  et  al.  1990),  in  which  transcription  initiates  at  a single  position, 
multiple  initiation  sites  at  one  promoter  may  exist  for  plant  mitochondrial  DNA  (Rapp  and 
Stem  1992).  Closely-spaced  multiple  initiation  sites  were  also  found  for  the  sorghum 
promoters.  For  instance,  the  promoter  of  atp6-l  and  the  proximal  promoter  of  orfl07,  and 
probably  the  atp9  promoter,  bear  two  initiation  sites. 

Among  the  nine  putative  promoters  identified  through  this  investigation,  four 
promoters  deviate  from  the  consensus  core  motif,  as  observed  in  other  species  (Binder  and 
Brennicke  1994,  Lizama  et  al.  1994,  Mulligan  et  al.  1991,  Newton  et  al.  1995).  These 
sequence-diverged  promoters  have  been  proposed  to  be  weak  promoters  based  on  the  run- 
on  experiments  (Mulligan  et  al.  1991).  Data  from  the  present  study  do  not  allow  a 
conclusion  regarding  promoter  strength  between  the  four  diverged  promoters  versus  the  five 
conserved  promoters,  without  performing  run-on  experiments. 
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Comparative  analyses  of  the  middle  and  the  proximal  promoters  of  orfl07  with  one 
protected  fragment  showed  that  the  abundance  of  the  protected  fragment  for  the  middle 
promoter  was  less  abundant  than  that  of  the  proximal  promoter  (Figure  14),  although  the 
core  region  of  the  middle  promoter  bears  a YRTA  (TATA)  while  the  proximal  promoter 
does  not.  Transcripts  corresponding  to  these  promoters  do  not  illustrate  convincing 
differences  in  abundance  (Tang  et  al.  in  preparation).  This  suggests  that  there  may  not  be 
necessary  correlations  between  the  abundance  of  the  capped  transcript  and  the  abundance  of 
steady  state  transcripts. 

The  lack  of  similarity  of  some  plant  mitochondrial  promoters  might  be  explained  if 
plant  mitochondrial  transcription  complex  occasionally  employs  a scanning  mechanism  for 
transcription  initiation  so  that  the  polymerase  initiates  at  sites  that  possess  only  limited 
identity  to  the  polymerase  or  factor  binding  motif  (Brown  et  al.  1991).  Alternatively, 
transcription  initiation  may  involve  more  than  one  type  of  promoter.  For  instance,  not  all 
cappable  termini  that  have  the  YRTA  motif  are  suitable  sites  for  in  vitro  RNA  polymerase 
initiation,  as  demonstrated  by  in  vitro  analysis  of  the  rml8  promoter  (Rapp  and  Stem 
1992).  The  maize  rml8  promoter  fits  the  proposed  consensus  perfectly,  but  was  not 
recognized  by  the  maize  extract  that  correctly  promotes  transcription  of  protein-coding  genes 
(Rapp  and  Stem  1992).  Thus,  a complex  transcription  machinery  may  exist  for  plant 
mtDNA. 

Overall,  this  investigation  has  documented  nine  5'  primary  transcript  termini  of 
sorghum  mtDNA  by  demonstrating  that  each  is  a suitable  substrate  for  guanylyltransferase. 
The  defined  sequence  similarities  located  5'  to  the  transcriptional  initiation  sites,  revealed  by 
sequence  comparisons,  allow  the  identification  of  nine  putative  sorghum  mitochondrial 
promoters.  These  nine  promoters  exhibit  a consensus  of  two  domains  as  proposed  for  the 
maize  atpl  promoter:  the  core  motif  and  the  -10  bp  upstream  domain.  Five  of  the  nine 
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promoters  have  a conserved  core  motif  of  YRTA  as  identified  in  other  plant  species 
(Mulligan  et  al.,  1988a,  1988b,  Covello  et  al.,  1991,  Hanic-Joyce  and  Gray,  1991,  Brown 
et  al.,  1991,  Rapp  and  Stem,  1992,  Binder  & Brennicke,  1993),  while  the  other  four  bear 
only  limited  conserved  motif  of  ARTA  or  YTTA.  These  observations  suggest  that  plant 
mitochondrial  promoters  may  indeed  be  characterized  by  a certain  degree  of  flexibility  of 
promoter  sequences,  concordant  with  complex  transcription  machinery. 

Substantiation  of  the  capability  of  the  putative  promoters  to  function  in  transcription 
initiation  would  be  obtained  through  introduction  of  the  elements  in  a transient  gene 
expression  system,  in  vitro  transcription  assays,  or  ultimately,  into  a transformation 
system.  The  development  of  in  vitro  RNA  polymerase  assays  in  wheat  (Hanic-Joyce  and 
Gray  1991)  and  maize  (Rapp  and  Stem  1992)  allows  an  assessment  of  the  putative 
sorghum  mitochondrial  promoter.  The  sorghum  atp6-2  template  was  a suitable  template  for 
the  partially  purified  maize  RNA  polymerase.  The  initiation  at  the  site  predicted  by  primer 
extension  and  in  vitro  capping  suggests  that  the  identified  site  is  indeed  a promoter. 
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